YSEUM LIBRARY 2/2 MAY 6 1948 


December 1947). Volume | Part Ill 





MENDEL | DARWIN MORGAN 


PERE YT 


An International Journal of Genetics 





JOHANNSEN 
AOTIAVA 


Edited by 
Cyril D Darlington Dsc FRs 


London 


SNAVNNIT 


Ronald A Fisher ScD FRS 
Cambridge 


GALTON 


in collaboration with 


G W Beadle T Caspersson 


Pasadena Stockholm 


Th Dobzhansky B Ephrussi 
New York Paris 
@ Winge 
Copenhagen 


SAIYA AG 


WEISMANN 


Oliver and Boyd 
London and Edinburgh 


VILMORIN 
INVZNV11VdS 

















BOVERI BATESON WILSON 








INFORMATION FOR CONTRIBUTORS 


Heredity publishes three times a year original articles of 
interest to research workers, teachers, and students in the fields © 
of experimental breeding, cytology, statistical and biochemical ~ 
genetics and evolutionary theory. New research, history, © 
review and criticism are accepted. 


Manuscripts should be prepared so as to conform with the — 
usage in current numbers of the Journal, especially with © 
regard to Illustrations, References, Tables, and Summaries. 
The address of the Author and the Date of Receipt of the 
Manuscript will be printed under the Title which should be 
scrupulously abbreviated. The following abbreviations for 
References will be used :— 

CR. : : . Comptes rendus 

F. Genet. i : . Journal of Genetics 

P. Int. Gen. C. (7) . . Proceedings of the 7th International Genetics Congress 
P.N.A.S. ‘ : . Proceedings of the National Academy of Sciences, Washington 
oe Oe | a ; . Proceedings of the Royal Society of London, Series B 
P.R.S. Edin., B ‘ . Proceedings of the Royal Society of Edinburgh, Series B 
KALA. : : . RKeitschrift fiir induktive Abstammungs- und Vererbungslehre 

Text Ficures must be (1) suitable for line-block reproduc- 
tion, (2) lettered in pencil, (3) supplied with legends and 
(4) separated from the text but allocated to sections of the text. 


PxiaTEs should be mounted to fit or exceed the whole size 
of the page (10” x64”) if possible. . Half-tone plates can be 
bled if wished. Colour plates will be accepted only if the 
author bears half the cost of reproduction. 


Proor is supplied once, in Page form, together with an 
Order Form for Off-prints. Excessive corrections will be 
chargeable to the Author. 


OrF-PRINTS are free to the number of 50. Additional 
copies will be supplied at cost if ordered before the issue is 
printed. 


EpITors :— 


Dr C. D. DarLincTon Prof. R. A. FisHER 
John Innes Institution Whittingehame Lodge 
London, S.W. 19 Cambridge 


Single Numbers, 14/- or $3 net. Annual Subscription, 40/- or $8.50 net 


Printed and Published in Great Britain by OLiver AND Boyp Ltp. 
Edinburgh: Tweeddale Court. London: 98 Great Russell Street, W.C, 





HEREDITY VOLUME! PART Ill DECEMBER 1947 





HEREDITARY MALFORMATIONS IN MAN * 


TAGE KEMP 
University Institute for Human Genetics, Copenhagen 


Received 23.v.47 


SERIOUS congenital physical malformations, especially skeletal, occur 
in at least one per cent. of all newborn children. On account of the 
great mortality for these affections, they are less frequent in the total 
population. A majority of these congenital lesions are hereditary, 
but this does not apply to all of them. Some of them, for instance, 
may be sequele of injury early in feetal life ; for example, deformities 
from constriction by amniotic cords (fig. 1), double-monster (fig. 2), 
acardia in twin pregnancy (fig. 3), or they may be due to serological 
disharmony between the mother and the foetus (e.g. in rhesus-nega- 
tive mothers), to infections (e.g. with rubella or toxoplasmosis) or to 
intoxications during pregnancy. 

It is the hereditary malformations that I shall mention here— 
not in a systematic review, but those conditions investigated in the 
Institute of Human Genetics in Copenhagen in recent years—which 
have proved to be of particular genetic interest. For, as a matter of 
fact, these physical malformations constitute the group of hereditary 
affections that are most suitable for genetic investigation. They are 
congenital, and they are demonstrable throughout life; further, 
they are easy to recognise from their description by laymen. Their 
occurrence can therefore be followed back through many generations, 
and pedigrees of their inheritance are relatively easy to establish. 
Indeed, the first lesion for which Mendelian inheritance was demon- 
strated, was a physical malformation—namely, brachydactylia— 
which Farabee in 1905 described as a dominant inherited lesion in 
three large families in which altogether 99 members presented the 
malformation. 

It has been practicable later, through the study of other bodily 
malformations, to show that the laws of heredity which we know 
from experimental genetics apply correspondingly also to hereditary 
_ lesions in man. 

As an example of the numerous congenital malformations of the 
skull a family with craniofacial dysostosis (Crouzon’s disease) is shown 


* A paper read before The Genetical Society in London, 25th November 1946. 
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(plate II, fig. 5, and pedigree 1). A woman with Crouzon’s disease, 
without any known cases of the disease among her ascendants, had 
in all five children in two different marriages—by her first husband 
one normal daughter and one daughter with Crouzon’s disease, by 
her second husband one son with Crouzon’s disease and a pair of 
dizygotic twins, a normal girl and a boy with Crouzon’s disease. 
The pedigree confirms the view advanced by Apert in 1907, that this 
disease may arise from mutation as a dominant hereditary affection. 
So far, however, the mutation rate for this rather rare disease has not 
been calculated. I should consider it impossible in a small country 
like Denmark to collect a sufficient number of cases for this purpose. 

Harelip and cleft palate are very common malformations, and a 
hereditary disposition is the essential etiologic factor. There are, as 
shown by Fogh-Andersen, two different malformations with no genetic 
connection, viz. (1) harelip with or without associated cleft palate 
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Pedigree 1. Craniofacial dysostosis. The disease has probably arisen through mutation 
in number 1. 











(ha. (+ cl. pa.)) and (2) isolated cleft palate (cl. pa.). The frequency 
at birth of these malformations in Denmark is 1-50 per mille (+ 
o-11 per mille), viz. 1:1 per mille ha. (+cl. pa.) and 0-4 per mille 
isolated cl. pa. Twenty-five per cent. of the children born with harelip 
or cleft palate die within the first year of their birth, and 10 per cent. 
of them have at birth severe associated malformations. Among the 
relatives of patients with harelip and cleft palate, however, no 
definitely increased occurrence of other affections is found. 

Fogh-Andersen has determined the empirical figures of genetic 
prognosis for the different groups of relatives by an enumeration 
according to the propositus method, The figures are :— 


for ha. (+ cl. pa.) : 


Frequency among children of ha. (+ cl. pa.) patients 2-0 per cent. 
Frequency among sibs, when parents normal . — . aa 
Frequency among sibs, when one parent has ha. 

(+ cl. pa.) ‘ ‘ , F j , or 
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for isolated cl. pa. : 


Frequency among children of cl. pa. patients, if 
hereditarily disposed (otherwise probably far 


smaller) . : . 7°0 per cent. 
Frequency among sibs, when parents normal and if 

hereditarily disposed ‘ . 12°0 ii 
Frequency among sibs without known hereditary 

disposition ‘ ; . 38 _ 
Frequency among sibs, when one parent has cl. pa. . 17°0 os 


The twin material permits of no exact determination of the degree 
of manifestation, but it seems to be much below 50 per cent. PI. II, 
figs. 6-9, shows 3 pairs of identical twins with harelip and cleft palate ; 
in figs. 6 and 7 concordant and on left side in both twins (congruent) ; 
in fig. 8 concordant and reversed asymmetrical (mirror-imaging) ; 
and in fig. 9 discordant. 

Most cases of ha. (+ cl. pa.) must be supposed to be hereditary, 
whereas in isolated cl. pa. there is probably a considerable admixture 
of non-hereditary cases. The most likely manner of inheritance for 
ha. (+ cl. pa.) is that of “ conditioned dominance” with sex limitation 
to males and considerably less manifestation of the heterozygote than 
of the homozygote, so that the affection is in some families apparently 
dominant (cf. pedigree 2), more often, however, apparently of a 
recessive character (cf. pedigree 3). 

These new results concerning the inheritance of harelip and cleft 
palate have been achieved by Fogh-Andersen. He examined a 
sample representative of the 4000 harelip and cleft palate patients in 
Denmark, including 500 ha. (+cl. pa.) patients and 200 cl. pa. 
patients and their families, using the statistical genealogical method 
known as Weinberg’s ‘“‘ Proband ” (propositi) method. When in- 
vestigating the inheritance of more rare malformations it is not 
necessary to limit the study to a sample, it may be possible to collect 
all cases of the malformation in question living in the country. Trier 
Morch’s study of the chondrodystrophic dwarfs in Denmark may 
exemplify that. 

Trier Merch (1941) examined all living (86) and 22 recently 
deceased chondrodystrophics in Denmark, and in addition a Swedish 
family. He concludes that all cases of chondrodystrophy are of 
genotypical nature, transmitted as a dominant character, arising 
rather frequently through mutation. When both parents are normal, 
chondrodystrophy will appear once in every 10,000-12,000 births. 
Pedigree 4 shows a case of chondrodystrophy probably arisen through 
mutation (the patient to the left in pl. III, fig. 10) ; his ascendants 
and all known members of the family were normal, and no con- 
Sanguinity occurred in the pedigree. Normal mothers and fathers 
of chondrodystrophics are older than those of non-chondrodystrophics. 
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Quite different from chondrodystrophy is osteochondrodystrophy, 
Silfverskidld-Morquio’s disease, which is characterised by multiple 
disturbances in the anlage of the epiphyses, irregular ossification in 
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Pedigree 2. Harelip and cleft palate through 3 generations (pedigrees 1-3 from Fogh- 
Andersen). 
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Pedigree 3. Harelip and cleft palate as a recessive character. 
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Pedigree 4, Chondrodystrophy probably arisen through mutation in number 7 (from 
Trier Morch). 


the primary and secondary centres of ossification, and decreased 
growth. Recent investigations (Helweg-Larsen and Trier Merch) 
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have, however, shown that osteochondrodystrophy has to be divided 
into two forms :— 

(1) Morquio’s syndrome, which is inherited as a monogenic 
recessive, and in which the skeletal changes are localised largely in 
the vertebral column (fig. 11 and pedigree 5). 

(2) Silfverskidld’s syndrome, which is transmitted simply as a 
monogenic dominant character, and in which the skeletal changes 
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Pedigree 5. Osteochondrodystrophy (Morquio’s syndrome) as a recessive character, 


Family IV. 
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Pedigree 6. Osteochondrodystrophy (Silfverskiéld’s syndrome) as a dominant character. 
(Pedigrees 5 and 6 from Helweg-Larsen and Trier Morch). 


are localised particularly in the extremities (fig. 12 and pedigree 6). 
So far it has not been possible to calculate the mutation rate for 
Silfverskidld’s disease. 

Arachnodactyly shows exactly the reversed symptoms of chondro- 
dystrophy—long, slender acuminated extremities with spider-like 
fingers and toes and slim body, and often combined with luxatio 
lentis (Marfan’s syndrome, pl. IV, fig. 13). Marfan’s syndrome is 
inherited as a dominant character (pedigree 7); the disease is, 
however, very rare, and it has not therefozc been possible so far 
to calculate the mutation rate. 
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Another hereditary disease presenting generalised malformations 
of the bone is osteogenesis imperfecta (osteopsatyrosis, fragility of 
bones, Eddowe’s disease). Fig. 14 shows a very severe case, which 
occurred isolated in an otherwise 
normal family. The patient was 
a complete invalid during his 
whole lifetime. He had to stay 
in a hospital bedridden for many 
a long year ; it is the reason why 
he did not propagate. Probably 
his disease had arisen through 
mutation. In fig. 15 a patient 
suffering from a less severe but 
yet a rather serious case of osteo- 
genesis is seen; her family was 
normal but she herself had borne 
a child with osteogenesis. The 
patient on fig. 16 had a still milder 
case of osteogenesis although he 
was an invalid; his mother, and 
her father and sister, suffered from osteogenesis. Apparently the 
disease was inherited as a dominant in this family. Thus the gene 
causing osteogenesis imperfecta varies considerably in manifestation, 
and again it has not been possible to calculate its mutation rate. 

Eskelund and Bartels (1941) described a pair of identical male 
twins with spina bifida, born in Viborg, Jutland, presenting a menin- 
gocele in the sacral region, eventration, abnormal development of the 
umbilical vessels, aplasia vesice, diastasis symphyseos, deformation 
of the columna vertebralis and pedes equino-varus (pl. I, fig. 4) ; 
this picture has been described by Aschoff as spina bifida umbalis. 

Hindse-Nielsen (1938) investigated 124 families in Denmark with 
at least one case of craniorachischisis. Consanguineous marriage was 
found only three times among the 124 couples of parents. The greatest 
number showing the deformity in the same family was 7, distributed 
among three groups of sisters and brothers. Among 12,550 persons 
belonging to these 124 families, Hindse-Nielsen found 50 extra cases 
of this deformity ; in the 124 families were 131 sibships and these 
131 propositi had in all 548 brothers and sisters ; 28 (5:1 per cent.) 
of these siblings had spina bifida aperta, a part of craniorachischisis. 
It is consequently probable that some hereditary component must be 
reckoned with in the genesis of this malformation ; it is not, however, 
a question of simple dominant or recessive heredity. 

Some of the most severe malformations of the extremities are cleft 
hands and feet and other forms of ectrodactylia, and congenital 
amputations. 

In Denmark, Birch-Jensen investigated 25-30 families with one 
or more cases of cleft hands. About go per cent. of the cases were 
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Pedigree 7. Arachnodactylia as a dominant 
character (from Filskov). 
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isolated examples in the family (fig. 21) ; in about 10 per cent., 
however, the malformation was inherited as a dominant character 
through several generations (pedigree 8 and pl. V, figs. 17-20). 
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Pedigree 8. Cleft hands and cleft feet as a dominant character. 























Still less dependent on hereditary factors are the congenital 
amputations of the extremities. Birch-Jensen examined about 140 
families with at least one case of congenital amputation and nearly 
every case appeared isolated as a single case in the family (fig. 22) ; 
in about 10 of the families a malformation was found in more or less 
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Pedigree 9. Congenital amputation (recessive ?). (Pedigrees 8 and 9 from Birch-Jensen). 
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distant relatives. Rather exceptional is the family demonstrated in 
pedigree 9, and figs. 23-24, showing congenital amputation in a 
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brother and a sister. 142 patients with congenital amputation had 
altogether borne 76 children, and none of these 76 children suffered 
from congenital amputations or other malformations. The malformed 
sister in pedigree g married a man with a congenital amputation 
and still they had a normal daughter and a normal granddaughter. 
The malformed brother had six normal descendants in two 
generations. 


The more recent Danish studies on hereditary physical malforma- 
tions and the way in which these lesions are transmitted have now 
been mentioned to illustrate the methods we use in the Institute of 
Human Genetics in Copenhagen. 

In our genetic register which, as far as possible, is a complete 
card index covering all the more serious hereditary affections 
observed in Denmark, we have the starting-point for more thorough 
investigation into particular fields—in many instances, of the in- 
heritance of a special lesion. The procedure then is as follows: The 
physician, who is trained as a specialist in the field concerned, makes 
a thorough investigation of the individual patients with the lesion 
in question and of their families, partly on the basis of hospital records 
and other documentary material, partly by travelling about, visiting 
and examining the individual patients in their homes. 


Among similar investigations undertaken with a view to other groups of diseases 
and carried out by a number of specialists attached to our institute—studies that 
have been completed already or will be concluded in the near future—the following 
lesions may be mentioned :— 

Medical diseases. Hemophilia, hypertension, nephrosclerosis, diabetes mellitus, 
gastric and duodenal ulcers, bronchial asthma, situs inversus of the viscera, thyroid 
diseases. 

Cancer. Cancer of the breast, cancer of the uterus, double cancer, leukemia. 

Nervous diseases. Myotonia, progressive muscular dystrophy, spinal muscular 
atrophy, Huntington’s chorea, epilepsy. 

Mental diseases. Schizophrenia, tuberous sclerosis, mongoloid idiocy. 

Skin lesions. Epidermiolysis bullosa, anhidrosis and neurolabyrinthitis, von 
Recklinghausen’s disease. 

Eye lesions. Aniridia, coloboma, buphthalmia, glaucoma, retinitis pigmentosa, 
Leber’s optic atrophy, angiomatosis of the retina. 

Ear lesions. Sporadic recessive deaf-mutism, heredolabyrinthic deaf-mutism, 
otosclerosis. 

Besides these investigations of the inheritance of the individual diseases the 
institute is occupied also with other tasks such as, for instance, investigations of 
twins and of certain definite groups of the population, e.g. gipsies, prostitutes, some 
asocial elements, and adopted children. In this way the registration of the 
hereditary diseases in Denmark is supplemented. 

On the basis of the experiences gained in these investigations, the institute 
exercises its hereditary hygienic or eugenic activity as adviser on questions of 
sterilisation, induced abortion, marriage, adoption, and of special relief. The 
material of the registration department has proved to be of much use to this activity. 

In the department for experimental pathological genetics investigations on 
cytology and experimental studies on hereditary diseases occurring in the common 
experimental animals are carried out. 
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The investigations on hereditary malformations mentioned in this 
paper exemplify the means and methods used in our institute for 
studying the various groups of hereditary diseases. 


SUMMARY 


Investigations on the inheritance of various congenital malforma- 
tions are described, viz. craniofacial dysostosis, harelip and cleft 
palate, chondrodystrophy, osteochondrodystrophy, arachnodactylia, 
Marfan’s syndrome, osteogenesis imperfecta, craniorachischisis, cleft 
hands and feet, ectrodactylia and congenital amputations. 

Through the study of these malformations it has been possible to 
show that the laws of heredity known from experimental genetics 
apply also to hereditary lesions in man. 
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1. DETERMINACY IN LANGUAGE 


Tue achievement of two centuries of philology has been, first, to 
show the family relationships of languages from which evolutionary 
changes could be inferred and, secondly, to show certain of the laws 
by which these changes take place. Some of these laws are important 
in demonstrating a general principle of determinacy ; such are the 
laws of parallel development, both phonetic as in sound shifts and 
consonantal dissimilation, and grammatical as in loss of gender or 
inflexion. These laws imply the regular relationship of cause and effect 
over great areas and great periods of time. Their scope has naturally, 
although sometimes unduly, impressed linguists with the uniform 
character of linguistic processes throughout mankind. 

There are, however, other laws of development which point to 
differences amongst men. Such are the principles of the individuality 
of speech, the local determination of dialect, and the instability of 
newly transferred or imposed languages. These also demand 
explanation in terms of cause and effect. But the causes of the 
differences, no less than the causes of the similarities, must be partly 
genetic. It is these genetic components whose actions I wish to 
examine. 

2. THE POINT OF ATTACK 


Languages as organised systems of expression establish, by their 
cultural influence, those mating groups or rather barriers between 
groups which, in all sexual organisms, are responsible for the formation 
of races and species. Languages must have an effect in producing 
genetic homogeneity within groups ; that effect must be proportional 
to the time and degree to which the barriers to mating that they set 
up are effective and other barriers are not. Anthropologists and 
politicians have therefore had some excuse for identifying race and 
language. But it has, over most of the world, been a poor excuse 
owing to the obvious instability of language-race combinations in 
man. All great empires from the Roman to the Russian have, in 
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some measure, communicated their own languages to their subject 
peoples. The Romance languages cover that part of the Western 
Empire which has not been submerged by later empires. And the 
Western Empire was latterly only one-thousandth part descended 
from the old republic. Language therefore represents the top stratum 
of conquests ; and many of these conquests have depended on cultural 
rather than on numerical superiority. 

If transferences of language from one people to another had been, 
or could be, accomplished without change, there would be no ground 
for inventing or seeking a genetic component of language. But in 
fact the transferred or imposed language is always altered when the 
imposing force is removed. How are we to examine these alterations 
and identify their genetic components? The first step must be to 
sort out changes in language itself into elements much as, in genetic 
experiments, we sort out changes of phenotype into the effects of 
heredity and environment. For this purpose language may be taken 
as consisting of three elements : the first is the scarcely changing basis, 
the stems of words, whose constancy is used in tracing the genealogy 
of languages. The second is a slowly changing part, the grammar, 
whose large changes follow a cycle merely differing in speed in 
different members of the Indo-European family and _ therefore 
providing poor evidence of genetic control. The third is the most 
rapidly changing part, that of sound, whose changes follow different 
lines in every language and even dialect. Phonetics is evidently the 
most promising starting-point. 

Sound changes are of two kinds. The first may be described as 
changes of apportionment such as are covered by shifts and dissimilations. 
They often have, like grammatical changes, a cyclical and hence 
mechanical character depending on one another and on accent 
changes ; they are therefore again likely to be most indirectly related 
to the genetic conditions. The second kind of sound change is found 
in what we may call complement, or the total sound equipment of a 
people. As a question of complement we may probably include 
the existence of special sound combinations as well as of simple sounds 
in particular positions, such as the initial R lost in Basque and the 
initial and inter-vocal P lost in Celtic languages. Within the 
European languages there are certain clear divergences in complement, 
as in H, TH and KH, while between them and more remote languages 
much greater differences occur. Change of complement can scarcely 
depend on fortuitous mechanical conditions. It is, on the other hand, 
most likely to depend on the inherent capacities of individuals to make 
the sounds concerned. It is with changes of complement, therefore, 
that we ought to begin our investigation. 

It is, further, in Europe that we have the best chance of establishing 
our case, on account of our much greater knowledge of the present 
and past relationships of the peoples and languages of this region than 
of any other. 
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3. GENETIC PREMISES 


Before proceeding any further we must consider what differences, 
on grounds of our present knowledge of genetics, we are likely to find 
and how we may have to measure them. In the light of historical 
and contemporary observations, genetics would lead us to assume that 
we have in all parts of Europe a mixture of individuals of genotypes 
whose differences, in part adaptive to local geographical and cultural 
conditions and in part accidental, fall into two types of spatial 
variation : (i) clinal, to use Huxley’s expression, that is, showing 
statistical gradients in respect of the proportions of individuals with 
contrasting characters ; (ii) éribal or broken up into discontinuous 
groups by barriers to mating. 

These conditions seem unfavourable for discovering any genetic 
influence on speech. Languages are obviously more homogeneous 
than peoples. In adopting a language, however, the individual 
subordinates himself to the community with which he wishes to 
communicate. The language must therefore be determined by the 
community whose genetic properties will be measurable statistically 
from individuals. We shall return to the statistical method later as a 
means of measuring relationship. First, however, we must consider 
the direct mechanism in the individual. 

The mechanism concerned is the apparatus of voice production. 
It shows the highest degree of variation in man. What has hitherto 
been a personal observation can now, following the work of Luchsinger 
and Tarnéczy, be put in ascientific form. Although family resemblances 
are high, no two of us, apart from identical twins, are alike in voice. 
Some differences depend on hard external structure ; others, like 
Sturtevant’s difference in tongue rolling, no doubt depend on less 
obvious muscular and nervous control. Now it is a principle that 
all variable characters of a species (apart from those like sex and caste 
which are co-operative) will also vary absolutely or relatively, as 
between its subordinate endogamous groups, its races or tribes. There 
is therefore no reason to doubt that the observed differences in capacity 
for sound production amongst different races and nations of man have, 
in part, a genetic basis. 

Such differences in capacity need be only very slight. They need 
merely suffice to establish preferences on a principle of economy or 
ease in speech which expresses itself in many linguistic rules. For 
example, by the principle of consonantal dissimilation one consonant 
of a group is altered to make the group as a whole easier to articulate. 
Meillet has pointed out the uniformity of the changes that result. 
This uniformity arises from a uniform difficulty for one people. Wha: 
we are now concerned with are difficulties greater for one people than 
for another, difficulties which will lead to preferences in one and not 
in another, and hence to divergence. 
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Thus we may explain, for example, the failure of the tribe of 
Ephraim to pronounce the word shibboleth correctly. That marginal 
tribe, we must assume, differed from its neighbour of Gilead in such 
a way that SH was not impossible but more difficult to articulate 
than S. The cause, whether the width of the tongue or the narrowness 
of the mouth, does not concern us. The consequence, the sound of 
SH being converted to S in their dialect, so that they “ could not 
frame to pronounce it right ” (Judges, xii, 6, A.V.) is what matters. 

Many will object at this point that education will overcome these 
differences or difficulties. For a sufficiently insensitive ear this can 
appear to be true. But in fact the reverse argument is more important. 
The force of unconscious imitation is the most powerful discipline 
known to free peoples. We see this best in slave-owning societies, 
such as that of the southern United States, where the owners and 
the slaves, genetically dissimilar, by imitating one another, have 
produced in the course of 300 years a convenient intermediate 
compromise. By seeking their lowest common denominator, two 
non-combining groups in one community can thus reduce (but not 
abolish) the appearance of those limitations of speech which are 
inherent in each. But nearly all languages are produced by com- 
munities composed of a single combining group. Imitation within 
communities will then be between genetically similar individuals and 
will therefore serve to magnify differences between communities 
rather than reduce them. A slight, and indeed only statistically 
identifiable, genetic difference will establish a linguistic difference 
and, secondarily or mechanically, a linguistic divergence. 

Genetic limitations and preferences would be expected to express 
themselves at all three levels, the individual, the group and the race 
or nation. The most direct expression of genetic character is seen of 
course in the individual. Impediments of speech are universal in 
infants and different for every infant in each family. Most of these 
impediments are overcome before maturity but a few remain in the 
adult. Particular impediments are characteristic of particular peoples 
speaking particular languages, their own or others. And these are 
found to affect retreating sounds, those for example, like R and H 
in English or H in Serbo-Croat, which are maintained and have even 
been restored, by the written word. 

The group expresses its genetic character in dialect. In this 
respect dialect stands between the imaginary free speech of an 
individual and the standard or high language of a civilised nation. 
Dialect is most significant genetically : the group who make it are, 
in the first place, as close as we can get to a mating group, a unit of 
genetic recombination ; in the second place, they are a group un- 
restrained by the conservative force of the written word with its 
traditional articulation whose effect we have just noted. The effect 
of inbreeding in promoting the proliferation of dialects is auto- 
catalytic. For the dialects, once formed, prohibit outbreeding ; they 
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do so with equal success in the flat plains of Hungary and in the 
deep valleys of New Guinea. 

The crucial importance of dialect for the genetic understanding of 
language is seen where a single standard language has been imposed 
by a single political and cultural agency on a people sharply divided 
genetically into two groups. Most European countries provide 
examples but the French division is best documented. The transition 
between southern and northern dialects is, as in England, a gradual 
one but, according to Meillet, if a boundary is drawn it has to run 
from Lyons to Bordeaux, not directly, but skirting the northern 
edge of the central massif. What does this speech bounddry mean ? 
It has been a cultural or political barrier at various times, in the days 
of the Langue d’Oc and of the Albigenses as well as of Vichy, but 
always as a result of invasions from beyond the Rhine. Although 
submerged by the political power, at one time of Rome, at another of 
Paris, it is the natural boundary between a fixed native population 
and one which has been subject to mixture with northern invaders. 
It is a genetic boundary. 

When we reach the language level, what interests us is the newly 
imposed language. Its mechanics are revealed by the changes the 
negro slave has made in the different languages of his masters. Con- 
fronted with French his preference asserted itself, whether in rhythm 
or sound, by breaking it down into Creole, a new language. 
Confronted with English and Spanish he merely modified the 
sounds. As Johnson remarks (1926): “the harsh ‘ ts’ sound is dis- 
pleasing to the negro ear as well as troublesome to the negro tongue, 
so it (dat’s) 1s softened to ‘das’. Similarly he generally reduces 
TH to T or D. In both cases it is no doubt the tongue which prefers 
and adopts, and the ear which later approves, the easy course. 

When we take a longer view, we see the imposed language revealing 
genetic action in two ways, by its divergence and by its instability. 
The new speakers change a language to fit their preferences in sound 
production. And when they themselves are genetically mixed, as 
they usually are, they themselves change and the language remains 
unstable for a long time. To the first cause we have to attribute the 
diversification of the Romance languages, and to the second, the 
instability of English or early Latin in contrast to the stability of 
Persian or Polynesian. 


4. THE VIEWS OF THE LINGUIST 


With these genetic principles in mind we can put a proper value 
on what linguists themselves have said. It is only recently that genetics 
has enabled us to distinguish between heredity and environment, or 
between genotype and phenotype. In the absence of either of these 
distinctions the linguist (and the social scientist likewise) has necessarily 
failed to put race and culture, or individual and group, in their proper 
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relationship. His opinions in consequence have been diverse and 
sometimes confused. They may be represented by three conflicting 
examples. 

My first example will be van Ginneken who has both propounded 
and illustrated the principle of genetic determinacy in language. 
His understanding of some of the consequences of hybridisation seems 
to be correct, and also of the temporary effect of a governing class : 
as he says “ naturam expellas furca, tamen usque recurret.” In 
consequence I believe that he intends his “ revivals,” or reciprocal 
changes in sounds, to have the precise socio-genetic interpretation 
that I shall put upon them. But when van Ginneken assumes, on 
the genetic side, polymeric factors, recombination, crossing-over and 
dominance, and, on the social side, a balance of individuals in 
segregation ratios with preferences for particular vowels, linguists 
and geneticists equally will find him hard to follow. He is taking 
over from the early Mendelians a mechanical naiveté, which is 
altogether useless in the field of language. His attack on the problem 
of circular sound changes in the Teutonic languages also seems to 
underestimate the difficulties I have alluded to earlier. On the other 
hand van Ginreken has collected facts which will be of great value 
to the later development of the subject. And he was the first to 
recognise the whole possibilities of the genetic approach to language 
** qui pour l'étude de ’hérédité humaine apportent un trésor inattendu 
de données.” 

The more sophisticated views of Meillet may be summarised as a 
second example by an extract in his own words (1925) :— 


En apprenant une langue nouvelle, les sujets ne perdent pas pour cela leur 
hérédité. . . . On est amené ainsi 4 supposer que les innovations les plus caractér- 
istiques du frangais tiendraient, non seulement a la fagon dont le latina été prononcé 
en Gaule, mais 4 une hérédité d’habitudes acquises par les sujets parlant gaulois. 
Avec cette forme de l’hypothése, les objections que l’on fait souvent a la théorie 
du substrat se résolvent immédiatement. Le passage de u fermé a # sur le domaine 
gallo-roman et en Alsace ne serait pas une survivance immédiate du gaulois, mais 
leffet lointain de certaines habitudes acquises transmises par hérédité. 

Si l’explication—au moins partielle—de la diversité des formes prises sur 
différents domaines par une langue qui se généralise est souvent contestée, c’est 
qu’on envisage les faits d’une maniére trop mécanique: il ne faut pas s’attendre 
a trouver dans la langue généralisée des particularités du ‘‘ substrat ” sous forme 
brute. L’action est complexe et se manifeste sous des formes au premier abord 
peu apparentes. 


In these remarks three arguments are important :— 


(i) The determinacy of development of language can be due 
only to the action of heredity. 

(ii) This action is a long term effect operating on complex 
principles of its own and not by simple mechanical transference as 
many have supposed. 

(iii) The heredity itself is the result of an acquired habit, the 
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Gauls being thus condemned to speak French in a Gaulish manner 
by their habit of having spoken Gaulish earlier. 

The first two arguments are entirely acceptable to modern 
genetics. It is only the third that is untenable. For assuming the 
inheritance of acquired characters, a linguist must not be blamed 
too much : a little Lamarckian superstition has been all the social 
scientist has had in the past to stand between him and the pit of 
racial error. The argument may, however, be inverted. Gaulish 
and French owe their parallel properties to the fact that they have 
been formed by the same people with almost the same heredity. 
Translating Meillet’s views into modern genetics we should have 
to assume an ultimate genetic determination of language but a 
determination which is always imperfect because it operates on 
large and mixed populations over great periods of time during which 
these populations are themselves liable to change. 

As a third example I will give the most distinguished representative 
of the opposite point of view to one I am proposing. Jespersen has 
recognised and defined the cultural relationships of the individual to 
the group. He could not, however, understand their genetic relation- 
ships. He recognised that his individuals differed in importance, being 
** commonplace ” or “ unique,” but he could not recognise that they 
did so for genetic reasons. On the contrary he has explicitly denied 
the genetic, and even the structural foundations of language. Thus 
“the little differences which are of course found in the arching of 
the palate, in the size and form of the nasal chamber, etc., seem to 
have no significance whatever.” And “the view that particular 
features of the pronunciation of Jews, or of Negroes in the Southern 
States of America, are due to racial peculiarities . . . is certainly 
erroneous.” And finally “the conditions governing changes in 
sounds in course of time . . . are everywhere the same.” 

In these ways, putting emphasis for lack of evidence, Jespersen 
resists the smaller cult of the “‘ folk-mind” to embrace the larger 
mysticism of ‘‘ the things common to all mankind.” In doing so he 
puts the issue which I shall contest squarely before us. We have to 
answer one question : Do racial differences affect speech, or changes 
in speech, or have all the peoples of the earth the same potentialities 
and preferences for speech without regard to the different structural 
and mental characteristics inherent in them ? 


5. THE TH SOUND MAP 


The European sound complement varies most clearly, and the 
history of the variation can be traced most accurately, in respect of 
the TH sound. I therefore propose to use it as a pilot test and to 
illustrate methods and principles. 

It will be convenient to group as TH the simple voiced and 
voiceless fricatives, the Saxon 5, the Greek @ and the Basque compound 
8 
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TTH which is transliterated in Spanish as z. While recognising their 
affinity I shall exclude the aspirated dental of Sanskrit and Hindustani 
and the right- or left-tongued LL of Welsh. On this basis the 
distribution of TH in the world to-day, and the chief complement- 
changes that it has undergone in the past 3000 or even 6000 years, 
can be sketched with some certainty. 

The present distribution of TH is strikingly non-fortuitous. Apart 
from an outlier in south-west Africa, the Herrero, it seems to be 
confined to the south and west periphery of the Eurasian continent. 
It exists in three zones—the European, the Arabic, the Dravidian 
with Burmo-Siamese ; or we might say approximately the Christian, 
the Islamic, and the Buddhist. The TH character thus overrides 
the genealogical divisions of language at the same time that it shows 
a certain historio-geographical consistency. 

Our historical and pre-historical knowledge throws light at least 
on the first two of these zones. Common ancestors whether of 
languages or of peoples are no doubt more heterogeneous than we are 
led to suppose. But the common ancestral Indo-European language 
which seems to have arisen in the Danube basin seems also to have 
had the TH sound. Its western expansion brought it into contact with 
a TH-speaking Basque zone stretching, as place-names indicate, from 
Genoa to Bordeaux and Lisbon. In the east it met non-TH speakers 
including the forbears of the modern Finnish, Turkish and Paleo- 
siberian groups. In consequence TH seems already to have changed 
into an aspirated dental in Sanskrit and to have disappeared in 
the whole eastern and central Aryan groups—Slavonic, Georgian, 
Armenian, Persian, Hindustani, etc. The one exception perhaps 
consists in a few isolated tribes of Dardic speakers in the Pamirs, who 
are perhaps the fair-haired Wakhi tribes described by Sir Aurel Stein. 

The great Aryan expansion was followed in historical times, as we 
know, by a series of reverse movements into Europe of non-TH- 
speakers. Although, outside Britain and the Balkans, these migrations 
have not greatly altered the distribution of languages, they must have 
changed the racial composition of the speakers of those languages. 
It is not surprising, therefore, that the TH sound has been lost in a 
band of invaded countries. The change was pre-historic in eastern 
Europe. Elsewhere it can be dated. It began in Italy and moved 
north through France and Germany to Sweden in the period 
from 300 B.c, to A.D. 1400. The Celtic, Gothic, Illyrian, Venetic, 
Etruscan and Basque TH sounds were driven out of all these countries, 
with or without the languages to which they were attached. The 
prestige of a governing class had suppressed the TH sound in educated 
usage in the Latin-speaking part of the Roman Empire. After the 
overthrow of that empire, western Europe was confused phonetically 
as in other respects. There was a new sorting out of sounds. The 
whole of the Carolingian Empire lost its TH. In France there were 
flickerings of TH as seen in the French version of the Serments de 
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Strasbourg in 842. And to-day, according to van Ginneken, we find 
it in Savoie and Dauphiné and, in the Basque combination of TTH, 
in a region of Poitou. Beyond the Rhine TH receded in a wave 
which began in the ninth century in the south of Germany, reached 
the Baltic in the thirteenth, and submerged Friesland and most of 
Scandinavia in the fourteenth century. 

In the mid-European belt TH survives to-day only in the archaic 
spelling of personal and place-names. TH survives, however, in the 
French as well as in the English spellings of such important latinised 
names as Bothnia, Lithuania (Lettow of Chaucer), Ruthenia, Carinthia 
and Carpathian to recall the broad but transient empire of the Goths. 
In German official spelling TH remained until the present century. 
In Friesian, on the other hand, it begins to disappear in documents 
of 1405. 

The fringe in which the TH sound remains to-day runs from Iceland 
through Britain to Spain and Greece. Again we see that it overrides 
the boundaries of linguistic genealogy. It unites Teutonic and Celtic 
in the north-west, Latin and Euskarian in the south-west, Greek and 
Albanian in the south. The phonetic unity makes itself heard through 
the verbal divergence. 

In two areas the changes of TH repay more detailed study. In the 
British area we see a gradient or cline. Its maxima are in Iceland and 
Ireland, recalling an ancient commerce in women. It falls in dialect 
through Scotland and Wales to the north of England where the level 
is irregular but in general lower on the east side. It reaches a 
minimum in England nearest to the non-TH coast of France and 
also where Celtic place-names are fewest. The voiced TH becomes 
D in Sussex and Kent, in the Shetlands, and in the Yorkshire fisher- 
man’s dialect, and partly F or V in Cockney. In western Norway and 
Denmark it is reduced from a phoneme to a consonantal variable and 
indeed is entirely lost in most Fiinen and Jutland dialects. Finally 
in Breton it expires altogether but it has reappeared (in place of r) in 
West Jersey. Historically there has perhaps been some decline in the 
TH of standard English since King Alfred’s time. 

These transitions in space and time reinforce what the general 
map of Europe tells us, namely, that TH is being pushed westwards 
as an effect of migration and without respect to the language, Celtic 
or Teutonic, which carries it. 

The second most important area is that of the Iberian peninsula. 
Climatic and geographical as well as linguistic and historical evidence 
suggests that this region has been especially resistant to racial intrusion. 
The Latin language and later the rule of the Caliphate were successfully 
imposed on the people by numerically weak invaders. These two 
impositions by small governing classes assisted one another in suppress- 
ing for the time being the primitive TH sound which still survived 
in Basque. But in the central mass of Spain from the Asturias to 
Andalusia the racial character of the people, freed from foreign rule, 
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reasserted itself. One hundred and fifty years after the Moors had 
been driven out TH reappeared. It did so in different ways in 
different parts. In Castille, about 1620, c, z, and sometimes d came 
to be pronounced as TH. In Andalusia s underwent the same change. 
The preference was the same, but it expressed itself in different ways 
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Fic. 1.—TH sound map of Europe showing its division into three zones according to 
present frequency or past possession of the @ and § sounds. The dates in the middle 
zone mark the time of loss. (References, Section A). 


by etymological standards just as it does in the comparison of London, 
Manchester and Dublin. 

In this phonetic rebirth the east and the west failed to join. 
Catalan retained its concordance with Provengal and French, and 
Gallego with Portuguese. When, therefore, the New World came to 
be colonised by a mixture of Gallegos, Andalusians, Catalans, and 
even some non-Spaniards, it is not surprising that the Castillian lisp 
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failed to gain control, failed to reach the threshold value for an 
addition to the sound complement. The racial compound in America 
was different from that in the Home Country, and their unity of speech 
has never been recovered. 


Although the TH distribution is peripheral in Europe (and Asia), 
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Fic. 2.—Blood group map of Europe showing the regular gradients and a division into 
three zones by contour lines at arbitrary values of O-gene frequency in the population 
(data from Boyd, 1939, and from the more recent local authorities, Section B). 


there are, outside the TH ring, two vestiges of what we may call an 
anti-TH zone. The first and strongest development is that already 
mentioned in the west of the Peninsula, in Portugal and Galicia 
where a voiced TH occurs only as a consonantal variable in certain 
dialects and where the Castillian ¢ is unknown. The second is in the 
west of Ireland. It may be seen equally in the official Erse and in 
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pockets of dialect from Kerry in the sout’west to Sligo in the nort’- 
west. Its phonetic idiom is imitated in the song of 1688, the Lilliburlero 
(in Percy’s Reliques), thus :—‘‘ And he will cut de Englishmen’s 
troate.” This sentence expresses the opposite of the more characteristic 
east Irish habit of thrampling on the English. It seems to continue the 
anti-TH wave along the Atlantic fringe and may perhaps represent 
the tail-end of an ancient society which, prior to 400 B.c., ran along 
the coast from Africa and derived from the cultural relations that were 
so strongly developed in the Megalithic period. The reappearance of 
the anti-TH zone on the Asiatic fringe in Malaya likewise points to a 
very ancient survival. 

The Arabic region has also something to tell us of the distribution 
of TH. 

At the same time that TH speech was being pushed back in 
Europe a reverse expansion was taking place on the south side of the 
Mediterranean. The Islamic conquests entailed an expansion of the 
Arabic race and the Arabic TH. But this expansion depended on a 
rapid dilution of the conquering polygamous group by the over- 
whelming numbers of their converts and hybrids. This dilution was 
equally marked in its secondary and tertiary stages in the Kingdoms 
of Cordoba in 1490 and of Delhi in 1590. The governing class so 
diluted was far smaller than Arabic enthusiasm imagines or the 
wearing of the green turban suggests. It is not surprising to find, 
therefore, that TH quickly disappeared even in Egypt and survives 
to-day only for academic and sacerdotal purposes in the accents of 
the educated few. 

A third example of TH expansion has likewise proved abortive. 
Classical Hebrew, like classical Arabic, has an important TH sound. 
But the modern Hebrew of Palestine spoken by Ashkenazim Jews has 
(so Mr J. Catford tells me) lost the sound. Here we have two possible 
explanations, and the alternative is of great interest. Is the loss due to 
racial contamination with the non-TH speakers of the bygone Russian 
and Turkish empires? Or is it due to the cultural example of the 
Russian and Turkish speech? The condition is too recent (and 
therefore perhaps too transient) to enable us to say. 

Summing up: the distribution of TH in Europe is intelligible in 
terms of the available evidence from history and pre-history including 
our knowledge of place-names and the inferences of the origin of the 
Indo-European system of languages in central Europe. It favours a 
greater stability of populations than most historians and philologists 
have been inclined to assume. And it suggests that the mass of the 
people ultimately come to express their genetic character by the 
sounds of their language even when a governing class is able to impose 
the word basis. The value of TH in this consideration is merely that 
the European populations are mostly close to the threshold which 
divides its expression from its suppression and are consequently apt to 
change from one to the other under the influence of migration and 
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conquest. It can therefore be used as a marker to indicate gradients 
in the population of the whole continent through its frequency, its 
existence as a phoneme, and its time of origin or extinction. Such 
gradients correspond to the clines which genetics recognises as character- 
istic of the natural geographical populations of polymorphic species. 
We have therefore to discover whether, on independent evidence, 
any other clines can be mapped for man in the region we are 
considering. 


6. THE BLOOD GENE MAP 


The question is now as to whether any other genetic gradients in 
man agree with that of TH. A character is needed with a comparable 
appearance of selective neutrality. When I began my enquiries no 
such characters were sufficiently known in man. Adaptive clines like 
those for pigmentation in Europe are obvious enough. Others, too, 
like that for colour-blindness (Vernon and Straker, 1943), may have 
an adaptive taint. Now, however, in the blood groups we are 
beginning to have what we want. 

We may regard the A, B and O blood group genes as possible 
markers of three homologous segments of chromosome which are near 
neutrality in effect. Since the three are independent, in the sense that 
each has no single alternative, the percentages of the different types of 
chromosome A, B or O in the population of each region of Europe 
give three frequency-maps, showing contours and gradients. For 
small and backward regions such blood group maps are bound to be 
defective as yet, but they are accurate within the necessary one per cent. 
for the large and advanced ones. Amongst them the O group map 
(which has not hitherto been traced) shows the kind of agreement with 
the history of migrations that we are looking for. In general it suggests 
a movement from the east driving O into the periphery. Its contours 
enable us to divide Europe into three O-gene zones :— 


(i) A peripheral zone from Iceland to Greece with 65 to 
75 per cent. 
(ii) An intermediate zone from Sweden to France with 61-5 to 
64°5 per cent. 
(iii) Two zones, a main eastern zone and the Portuguese corner, 
with 61 per cent. or less. 


What relation have these zones with TH distribution? The first 
zone comprises all the TH-positive countries. The second zone 
comprises nearly all those countries which had TH and have now 
lost it. The third zone comprises those countries which have not 
had any recorded pronunciation of TH during the last 2000 or 
3000 years. 

The agreement between blood group and phonetic zones is 
complete except for Italy and Holland, which are 2 or 3 per cent. 
over 65. Their peoples should not therefore have lost their TH as 
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Italy did already at the time when the Latin language replaced the 
Greek, Celtic, Venetic and Etruscan tongues (100-400 B.c.), and as 
Holland did 1000 years later. 

The salient feature of this agreement is perhaps the Portuguese 
depression. But no less significant are those finer details which can 
be detected only in Britain. Here the O cline crosses the island in 
close keeping with the TH position in dialect, reaching its lowest 
level in the south-east. In Europe as a whole the O contours, like 
the TH zones, agree with the distribution of the great language 
families only in so far as these agree with our historic and pre-historic 
knowledge of migrations. 

Thus the O and TH maps agree more closely than the apparently 
confused history of Europe would seem to justify us in predicting. 
The isogens are almost exactly equatable with the isophons. 


7. GENES IN SOCIETY 


Having answered the question whether the genes act on speech 
we may now ask ourselves the quite independent question of how 
they do so. The problem divides itself, apart from the anatomical 
and physiological aspects already discussed, into the strictly genetic 
and the social aspects. 

First let us take the genetic aspect. 

The validity of the genetic component of language is supported 
by these first trials. What philologists have described as the substratum 
effect must be due to the integrated action of the genotypes of popula- 
tions differing in their sound preferences as individuals. Galton, 
Fisher, and most recently Mather, have shown how to detect and 
measure in families and populations the inheritance of properties 
which are not demonstrable in single individuals. Now it seems 
possible to show the inheritance of properties which, owing to the 
slowness of the effects of social integration, are not even demonstrable 
in the contemporary life-times of a multitude of individuals. Yet this 
slow social integration must be the basis of all relations of race with 
culture. In studying human heredity experimental technique has 
once again to be aided by elaborate statistical constructions and the 
novelty of the result depends largely on the method. 

It might have been that language was a purely cultural convention 
like the rules of mating. These rules (although inherently consistent 
within tribes) can have no primarily genetic basis ; but their genetic 
consequences are so great that they must have been exposed to strong 
natural selection. This selection has evidently favoured out-breeding 
since it has left only rare vestiges of inbreeding in Southern Asia. 

Language, it now seems, is in just the opposite position. It has a 
decided genetic basis but hardly any selective value. If the TH 
character had a selective value, owing for example to its suitability 
to a maritime climate, it could hardly show the peculiar distribution 
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we find. In the absence of a selective value we can consider whether 
it is determined by a polygenic balance or by a single gene complex. 
Either could match the blood group distribution and either could 
reflect ancestral origins. Selective neutrality would, however, do 
nothing to favour the development of a gene complex which must 
always be adaptive and, on the other hand, a polygenic basis is more 
likely for what is probably a quantitative character. Just such a 
correlation as is required between a major gene effect in the blood and 
a polygene effect on the tongue has been described in man where 
41°2 per cent. of diabetics are taste-blind and only 25-9 per cent. of 
non-diabetics (Terry and Segall, 1947). 

The social aspect of the problem has been the subject of expert 
study which has the same results whether it is approached with a 
genetic hypothesis or, as Jespersen has done, without one. With the 
genetic situation in mind, however, we can interpret the results with 
more confidence and express them with more precision. Clearly 
the more variation there is amongst individuals (owing to hybridisa- 
tion), the more unstable their language will become. And the more 
unstable a language, the more choice every maturing individual 
will have amongst alternative sounds to use. These sounds will 
differ in their local, their social, and their age classes. In making this 
choice the commonplace or unique individual (as Jespersen puts it), 
or rather the differing genotypes (as I put it), will have different parts 
to play. Further, on my view, the instability of the language will 
arise from a genetic difference between the people who made the 
language and the people who are now using it. 

To put it in another way : my view is an expression of the genetic 
axiom that the properties of individuals are determined by the 
reaction of genotype and environment. But this axiom is modified 
by a special circumstance, namely, that the genotypes of individuals 
themselves interact to produce an environment by an integration in 
space and an accumulation in time. This environment is what we 
call culture. The discontinuities between nations and between 
languages that we see in Europe are the steps cut in continuous genetic 
slopes by discontinuous cultures. 

The combination of methods and assumptions I have used for a 
selected and apparently trivial comparison gives us the means of 
opening a new field of enquiry. It is a field of incomparable richness 
for the study of man. It can provide a co-ordinating hypothesis 
capable of advancing the studies of language, of human genetics, 
of society and of culture in general, both now and before recorded 
history. It can provide a first means of evaluating the modes of 
integration of the masses of dissimilar genotypes which constitute 
classes, races and nations, modes of integration which underlie the 
development of all human culture. In doing these things its discipline 
offers a corrective to the ingenuous theories (or fairy tales) which in 
the past have done less to help science than to embroil nations. And 
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it requires neither the repudiation nor the exaltation of race but assumes 
a continuity underlying the tribal variation of man which more 
superficial methods have overlooked or disowned. 


SUMMARY 


1. The determinacy of development of language has been widely 
recognised. Some linguists have used genetic assumptions to explain 
this determinacy. Others have rejected these assumptions because 
they rested on too little or reached too far. 

2. Modern population genetics, however, enables us to understand 
the relations of genotype to phenotype and of the individual to the 
group or race in a way that removes these objections. 

3. Specific preferences for sound production are the most 
immediately profitable field of study. The TH sound is taken as an 
example. Its present distribution in Europe and Asia and its changes 
during the last 3000 years agree with the evidence of human migrations 
and conquests. The sound map also agrees with the A BO blood 
group frequency contour map in Europe. The Isophons follow the 
Isogens (figs. 1 and 2). 

4. This method of comparative and historical, phonetic and genetic 
study makes it possible to apply rigorous biological conceptions to a 
wide field in the social sciences. 


In considering the present distribution of the TH sound I am indebted to many 
colleagues in all parts of Europe and Asia for information. I am, however, 
particularly indebted to Prof. Menéndez Pidal of Madrid, to Mr S. G. West and 
his colleagues of The British Council in Lisbon, Mr J. Catford, member of the 
International Phonetic Association, Mr David Abercrombie of Leeds University, 
Mr Oswald Harland of Wetherby, Prof. M. J. Sirks of Gréningen University, 
Dr R. R. Race and Dr A. E. Mourant of the Lister Institute for references and advice. 

NorTE.—Since going to press I have received a detailed account of blood groups 
in France (M. H. V. Vallois, 1944, Bull. Soc. Anthrop. 5, 53-80), which indicates 
that a high O gene belt (66 to 68 per cent.) runs across Central France from the 
Basque Country. This zone includes van Ginneken’s TH-speaking areas. 
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ANALYSIS OF DATA FROM ALL POSSIBLE RECIPROCAL 
CROSSES BETWEEN A SET OF PARENTAL LINES 


F. YATES 
Rothamsted Experimental Station 
Received 30.iv.47 


1, INTRODUCTION 


WHEN assessing the value of different parental lines in plant breeding 
work on plants which are normally cross-fertilised, the device of 
making all possible reciprocal crosses and comparing the progeny 
of these crosses is often of value. 

The analysis of the data obtained in such a set of reciprocal crosses 
can be carried out by an adaptation of the analysis of variance 
technique. This adaptation, analogous to those appropriate to the 


TABLE 1 


Fertility (number of seeds per 100 florets pollinated) in reciprocal crosses of 
12 sibs of an F, family of Trifolium hybridum 
2 Parents 





a 


Sib 2 3g y ee I 5 8 9 10 12 4 Total* Mean* 





2 Ir 24 74 10| 167 134 153 144 87 68 320 155] 1228 154 
3 10 24 75 13] 95 178 190 163 84 119 61 158 | 1042 130 
7 9 48 27 331] 235 203 158 192 136 252 204 129] 1449 181 
II 20 30 58 20! 108 170 168 199 119 78 1399 #+%150| 11391 141 





I 95 112 124 122] 21 35 29 20 20 | 105 223 148 929 133 
5 83 123 123 205| 63 50 33 55 21 | 219 152 148] 1053 150 
8 | 80114 163 156| 41. 28 6 27 6: | 212 1g0 = 191 986 141 


3S Parents 


9 73 152 162 210] 58 32 39 42 64 | 248 171 237) 1253 179 
10 70 125 145 89] 61 71 go 69 53 | 177 143 134 883-126 











12 | 175 105 218 165 78 266 204 181 85 te) 3 ie 1600 160 


4 | 192 116 202 163 150 195 157 154 133 ° 2 | 155} 1617 162 








6 | 120 121 155 178 154 202 163 156 110 170 239 | o| 1768 161 











Total * | 888 968 1292 1288 987 1348 1193 1129 754 1642 1782 1668 | 14939 
Mean* | 111 121 162 161 141 193 170 161 108 164 178 152 152 














* Excluding incompatible crosses. 


more complicated forms of experimental design, such as incomplete 
randomised blocks, was developed to deal with the set of data shown 
in table 1, which gives the fertility in reciprocal crosses of 12 sibs 
287 
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of a certain F, family of Trifolium hybridum (Alsike). These data 
were obtained in the course of work at the Welsh Plant Breeding 
Station, and my thanks are due to Mr Watkin Williams for permission 
to use them to illustrate the method of analysis. 

This particular set of crosses is complicated by the fact that, in 
addition to nearly complete self-sterility, certain groups of sibs (2, 3, 
7, 11), (1, 5, 8, 9, 10), and (12, 4) are incompatible. This in- 
compatibility is clearly shown by the data. 

Before discussing the analysis of these results, we will consider the 
simpler case in which partial or complete self-sterility exists but in 
which there are no incompatible groups, and also the case in which 
the selfed plants behave similarly to the crosses. 


2. SELF-STERILITY WITHOUT GROUP INCOMPATIBILITY 
If the plants are self-sterile, but all crosses are fertile, a table of 
results of the type given in table 2 will be obtained. 


TABLE 2 
Data from reciprocal crosses when there is self-sterility 











Male Female parent | 

parent . @ ear Total 
1 = Ja: Ju + Y;. 
2 :. we. ae. Wee Y2. 
3 Js Joa — Jae + Ys. 
4 Ja Jaa Das — + Yq. 
‘ie. | Na’ Yar Vg. Bax ae wi 

















The »’s in this table may represent measurements on single plants, 
or the means or totals of measurements on a number of plants, possibly 
derived from a number of replicated plots. The Y’s denote the 
marginal totals of the »’s. 

It is apparent that the direct comparison of Y,., Y3., . . . will not 
give valid estimates of the differences between the male parents, since 
one female parent is missing from each total. Y,., for example, does 
not contain progeny from female parent 1. 

Efficient estimates can be obtained by the method of least squares, 
fitting constants for the effects of the male and female parents. (See, 
for example, Yates (1933).) Let there be & parents of each sex, and 
let the constants be : 


Mean: m. 
Male parents : a, dg, gy. « « S(a) = 0. 
Female parents : 5,, bs, b3, . . . S(b) = 0. 


We then have 


Jie = m-+-a,+bo+ €49, etc., 
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where €,2, etc., are residuals the sum of whose squares is to be 
minimised. 

The least square equations are as follows :— 








k(k—1)m+-(k—1) (a,+ag+ .. .)+(k—1) (6, +0.+ .. .)=Y.. 
(k—1) (m+-a,) +4,+53+ eh ; =Y;. 
(kK—1) (m+5,)+a,+a,+ ree =Y., 
From these equations, using the identities S(a) =o, S(b) =o, we have : 
m=y 
_ (k—1) Y;.+-Y.,—Y.. 
= Kk—2) etc. 
_ (k—1) Y..+Y,.—Y.. 
4 = k(k—2) etc. 


The analysis of variance is shown in table 3, dev*y being used to 
denote the sum of squares of deviations of all the y’s from their mean. 


TABLE 3 
Analysis of variance of the data of table 2 





oes Sum of squares 





Differences between parents (male | 2(k—1) | a,¥,.-+agYq. +... +b;Y.y+5gY.+... 
and female) 
Remainder . : ‘ . . | R—gk+1 By difference 


Total . , ‘ . | k(kK—1)—1 dev*y 




















This form of analysis is based on the assumption that the effects 
of the parents, male and female independently, are additive, and 
that any departures from this additive law are random and 
independent. If the y’s are themselves each derived from a number 
of replicated plots the remainder term may be compared with the 
experimental error obtained from an ordinary analysis of these 
replicates. This comparison will indicate whether there is any 
departure from the additive law. 

The above analysis may be extended in two ways. In the first 
place, since the effects of the male and female parents of the same 
line are likely to be similar, we may recast the analysis so as to 
estimate the average effects of a parent of a particular line (male or 
female), and the differential effects of the male and female parents of 
that line. There may, for example, be evidence that the progeny of 
line 1 are superior to those of other lines, but no evidence that this 
superiority is more marked for the male or female parent of this line. 
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In the second place, there may well be some similarity between 
both reciprocal crosses of a pair of parents, either due to some degree 
of incompatibility or to some specially favourable conjunction of genes. 
In this event the »’s which are diagonally opposite one another, 
Jig and 79, etc., will be more closely correlated than the »’s not so 
related. This point may be investigated by subdividing the remainder 
degrees of freedom in the above analysis into those derived from 
contrasts of the sums of pairs of diagonally opposite values, and those 
derived from contrasts of the differences of such pairs. 

Since the estimates of the combined effects of male and female 
parents of each line are derived solely from the sums of diagonal 
pairs, and the differences between male and female parents solely 
from the differences of diagonal pairs, the whole analysis splits into 
two parts, one based on the sums, and the other on the differences. 

The sums and differences of the diagonal pairs can be set out in 
two tables of the form given in table 4. 


TABLE 4 
Sums and differences of diagonal pairs of the data of table 2 














(a) Diagonal sums, 92+, etc. (6) Diagonal differences, y,.—y9;, etc. 
Male Female parent Male Female parent 
parent} 1 2 $ 4 «w Total parent Se 4 ww Pee 
I — Ue Ws Uy + U, I = Bete Se ee Vi 
2 — Ugg Ugg «-- U, 2 —— Ugg Ugg oo Vs; 
3 — Ugg «+ Us; 3 — U4 +s Vs 
4 = eee U, 4 —_/ eee Vi, 
Total S (U) Total o 


























The totals U,, etc., shown in the table of diagonal sums are 
obtained by summing both the row and column of the parent 
concerned. Thus Us=u,3+uo3+uggt+ ... The totals Vj, etc., in 
the table of diagonal differences are obtained similarly, except that 
the values in the column concerned are subtracted. Thus V,= 
—U13—Vogt%gat ... Wesee that U,=Y;.+Y.,, etc., V,=Y,.—Y., 
etc., and S(U) =2Y.. 

As before, the effects of the parents may be represented by 
constants, whose values can be estimated by the method of least 
squares. The two tables can be treated independently, thus providing 
separate estimates of the residual variances. It will be most convenient 
to use constants which represent the values of table 4 directly. 
Let these constants be : 

Table 4 (a): Mean: m’ 
Parents : ¢,, Cg, C3. - « S(c) =o 
Table 4 (5) : Differences (male-female) : d,, d., d3, . . . S(d)=o 














oo; fe oeel™ 
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Thus 


Uyg=m' +6, +6e+e'r9 
y2=d,—dy+e" 19 
The least square equations are : 


(k—1)m'+(k—-1) (Gq +ea+ « « .)=4S(U) 


(K—1) (m’+¢)+egtes+ ... =U, 
isd. sae 
Whence : . : ‘ 
m' = 29 
__kU,—2Y.. 
4=7V, etc. 


Thus, as we should expect, c,=a,+5, and d,=a,—J,, etc. 

It will be noted that the difference between the effects of different 
male parents is given by the differences of the quantities }(c+d), 
and of female parents by the differences of $(c—d). If the differences 
between male and female parents can be neglected the parental effects 
(male or female) are measured by the quantities $c. Thus, under these 
circumstances, the expected difference between the cross with parents 
1 and 2, and the cross with parents 1 and 3, is $(¢.—cs) ; that between 
the crosses with parents 1 and 2, and with parents 3 and 4, is 
4(¢,+¢2) —43(¢g+c4). 

The analysis of variance now splits into two parts, as shown in 
table 5. The extra factor of 4 has been introduced into the sums of 
squares to bring them to units of a single entry of table 2. 























TABLE 5 
Analysis of variance of sums and differences of diagonal pairs 
Degrees of freedom | Sum of squares 
Parents k—1 saral dev?U 
peg Remainder | }(k—2) (k—1)—1 | By difference 
Total .| $k(k—1)—1 dev’ 
{ Parents k-1 ok S(V*) 
difference (8) Remainder | $(k—2) (k—1) By difference 
big . «| $(k—2) $S(v%) 








The contrast between the mean squares for remainder (a) and 
remainder (b) will indicate whether there is any significantly greater 
= 
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similarity between reciprocal crosses of the same parents than between 
crosses of different parents, after allowance for the average effects of 
parents, male and female independently, has been made. In order to 
test if there are any differences between parents which are consistent 
for all crosses the mean square for parents (a) should be compared 
with remainder (a). Comparison between parents (6) and remainder 
(6) gives the similar test for consistency of differences between male 
and female parents. If an estimate of experimental error is available 
the remainder mean squares may be compared with the error mean 
square. 
The standard errors of the c’s and d’s are given by the formule 
V()= 


(k—2) = (207) 


V(a)=4 (208 


the appropriate estimates of of and o§ being obtained from the analysis 
of variance, and the factor 2 being introduced because each value of 
table 4 is the sum or difference of two values in table 2. In general 
the mean square for remainder (a) will provide the appropriate estimate 
for o, and that for remainder (4) for 0%, though if experimental errors 
only are under consideration the mean square for experimental error, 
if available, will be appropriate. 


3. NO SELF-STERILITY 


In the absence of self-sterility the data can be arranged in a k xk 
table similar to table 2, but with the diagonal cells occupied. The 
analysis of variance of this table presents no difficulties, since the two 
sets of marginal totals are orthogonal, and provide estimates of the 
differences between parents, male and female independently. The 
corresponding totals may be added to provide estimates of the 
differences between parents averaged over both sexes, and subtracted 
to provide estimates of the differential effects of the two sexes. 

The possibility of the existence of greater similarity between 
reciprocal crosses can best be tested by the procedure of section 2, 
omitting the progeny of selfed parents. The inclusion of the selfed 
matings in this analysis introduces additional complications, and 
would add little to the information provided by the partial analysis. 


4. SELF-STERILITY WITH MUTUALLY INCOMPATIBLE GROUPS 


If certain groups of parents are mutually incompatible the general 
procedure of section 2 can be followed, though the solution is somewhat 
more complicated, owing to the fact that all comparisons will not be 
of the same accuracy. The least square equations, however, are 
still directly solvable. 
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Let the k parents be divided into groups of p, g, 7, . . . mutually 
incompatible parents, so that p+q+r+ ... =k, and denote the 
constants, etc., of the first group of parents by the suffixes p1, p2, . . . 


with the convention that ¢,,+¢,,+ . . . =S(c,), ete. 
The least square equations are : 


Diagonal sums 
(U) 


B{p(k—p) +9(k—g) +... }m'+(k—p)S(cy) +(k—g)S(c,)-+...=48 
Group p icant teal te a) +S(¢,) +... =U 
(p equations) , P ; ‘ 
Group q alley chs i da =U,n 
(q equations) , , P ‘ j 


Diagonal differences 


Group p ili: i —S(d,) —... =Vir 
(p equations) : ‘ ‘ : 
Group 4 bai —q)d,,—S(d, —S(d,)—... =Vaa 
(q¢ equations) i ; 


The ¢ and d equations may be rewritten 
(=P) (m+ ep) S16) Up} 

(- g) (m'+¢_1) —S(¢q) =U ga 

“(k—P)dyn +8(d,) =Vpa 

‘(kd +S(d,)=Vor 


wre 





Summing the equations of each group, we have : 


p(k—p)m'+(k—2p)S(c,) =S(U,) 








q(k “— +(—29)8() =S(U,) 

“kS(d,) =S(V,) 

kS(d,) =S(V,) 

Hence ; 
b(k—P) 
S()= pt, stu) — AEP 
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Since S(c,) +S(c,) +... = 0 we have 


I 
Fog SUP) + 


I 


nies ew 


S(U,)+ «.. 


where 


_ plk—p) , a(k—9) 
= k—2p + k—2q ae ae 





It may be noted that since p+q¢+ ... =k 





= p q 
nies a aera rr i --| 


The above equation gives the numerical value of m’ (which will 
not in this case be exactly equal to twice the mean of the original 
values). By substitution for S(c,) and S(d,) in equations (A) we also 
find 


*~_{S(U,) —(k—p)*m’} 


(k—f) ep = Unt k—2p 





(k—P)dn. = Vin— 7 S(V>) 


It will be noted that the first of the original least square equations 
has not been used. This equation is redundant in virtue of the identity 
S(c) = 0. 

The degrees of freedom and sums of squares in the analysis of 
variance are given in table 6, in which N = 4{p(k—p)+9(k—g) +...}, 
the number of entries in the table of diagonal sums and differences. 








TABLE 6 
Analysis of variance with self-sterility and mutually incompatible groups 
Degrees 
of Sum of squares 
freedom 
, “- S(U)}? 
mb Parents k-1 #{4m’S(U) +¢,,U,:+... +tyglat ja 
Remainder| N—k By difference 
Total N—-1 ddev*u 
Diagonal (Parents k—1 (dy Vort..- +a Vat.) 
differences (5) 
Remainder} N—k+1 By difference 
Total. . N $S(v?) 
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It will be noted that, since m’ is not equal to the mean, it must be 
introduced explicitly in the sum of squares for the constants, and the 
ordinary correction for the mean deducted. 

The formule for the standard errors of the differences of the c’s 
and d’s may be derived from the solution of auxiliary sets of equations 
in the manner followed in partial regression analysis. There are 
certain points of difference, however, which are of general recurrence 
in least square solutions of this type, and which will therefore be worth 
describing. For this description we will use the notation customarily 
adopted in regression analysis (with the exception that, to avoid 
confusion, the c’s of this notation will be replaced by c”s). In this 
notation the regression coefficients 5,, bg, ..., satisfy the equations 


b,S (xf) +5,S(x x2) + ... = S(x 7) 
bS(xy%q) +59S(x3) + ... = S(x2y) 


C11) C19) C’yg) --- Satisfy a similar set of equations with the numerical 
terms S(*7), S(x2)); S(x37), -«e Feplaced by 1, 0, 0, ... , ¢’ 915 C’ ae, C'es 
a similar set with numerical terms 0, 1, 0, ... , etc., and ¢’;_. = c's, etc. 
] 
- V (by) = c'n0?, V(bs) = c'as0%, et, 
cov(b,),) = c’4,0°, etc., 
so that 
V(b: —ba) = (611 +6" 22—26'2) 0”. 
In regression analysis the numerical solution of the equations for 


the b’s is usually effected by first determining the matrix of ¢”s, and 
thence the 4’s from the relations 


by = 6S.) +6128 (%29) + ... 


In cases such as the present, in which the equations are such that a 
simple algebraic solution is possible, the procedure may be reversed, 
and the c’”’s determined from this solution. 

If there are redundant regression coefficients or constants such 


that 
“ Mbit HabetHabst... = He 
Bb +H' sbeth'sbst--. = B'o 


it has been shown (Yates and Hale, 1939) that the numerical terms 
I, 0, 0, ... of the set of equations for c’3,, c’y9, ¢’y3, . . . must be 
replaced by 


inp dep di... <del «4 —glengtie.. ..; 


where d, da’, .. . are so chosen that the ielations to which the 
redundant constants give rise between the coefficients of the normal 
equations still persist. 
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In the present case we have, in the solution for the diagonal sums, 
Cyrtepat eee tog teget eee +e tegt+ o. =O 


and the p’s for all the constants except m are therefore unity. Also 
the sum of the numerical terms of all the equations except the first, 
k in number, is equal to twice the numerical term of the first equation. 
Consequently the numerical terms for the set of equations giving 
C pasprs CtC., are 
A hee nee eee 
> k > k > k e, pe 


Hence the values for c’,;.,:, etc., are given by the substitutions 
I 


I 
Un = et Us2 = Uys = eee =U, =... = " e 


Substituting these values in the solutions already obtained and 
denoting the resultant value of m’ by m',,, we find 


See he 








a IK k—op k—2q k—2r 
BG Me on, I 
KK k—2p 
his 
enon Blt d+ eight -ore] 
k— 
P ‘oe I P 
(aan pal Pt g -t ow 





The other c”’s can be written down immediately from these expressions 
by symmetry, and it can be verified that ¢’y) «9, = ¢'g1 - p1- 
We now have 


V (Cpr —ep2) sn (C'p . pi tere . p2—2C'py . p2)o” 


2 
i> (207) 
V(6p1—Cq1) = (Cp : pite'g *gl —C' ps *ql —Ca : pu) 3 
I 


pn fare 
=(0) {p+ p+ 





i I 
(hp) (F—2p) * Ea F=29) | 
_ 2k(p—q)? 
K(E—2p)"(k—29)* 


where, as before, estimates of of are given by the remainder mean 
squares of the analysis of variance (table 6). These two expressions 
give the variances of the difference of any pair of parents in the same 
or in different groups. 
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The same procedure may be followed for the diagonal differences. 
It will be found that 


Vdd) = p=, (20%) 


V(dy—da) = (: os Mes te eed 


It can be verified that when p= gq=1 the expressions for 
V(6p1—g1) and V(d,,—d,,) reduce to the expressions already found 
for the case in which there is no group incompatibility. 


5. ANALYSIS OF NUMERICAL DATA 


We may now consider the analysis of the data of table 1. In this 
case k= 12, p= 4, g=5, f= 2, 5 =1, the parents in the four 
groups being sibs (2, 3, 7, 11), (1, 5, 8, 9, 10), (12, 4), and (6) 
respectively. 

The sums and differences of the diagonal pairs are shown in 
table 7a and table 7), the incompatible crosses being omitted. The 
U and V totals, and S(U,), and S(V,), etc., are also shown in these 


tables. 
TABLE 7a 


Diagonal sums of data of table 1 























Q Parent 
d Parent I 5 8 9 10 I2 4 6 U S(U) 
2 262 217 233 217 #157 243 512 275 2116 
3 207 301 304 315 209 218 177 279 2010 
9286 
7 359 326 g21 294 281 470 406 284 2741 
II 230 375 324 409 208 243 302 328 2419 
I 183 373 302 1916 
5 485 347 350 2401 
8 416 287 294 2179 (10515 
9 429 325 393 2382 
10 262 276 244 1637 
12 293 3242 
6641 
394 3399 
3436 3436 
29878 
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TABLE 75 
Diagonal differences of data of table 1 
i 

é Parent | , ah ae , Bey we 6 Vv sv) 

2 +72 +51 +739 +71 +17. —107 +128 +35 | +340 
3 —17 +55 +76 +11 —4: +8 —55 +37 | +74 ial 

7 +111 +8 —5 -—-30 -—-9 +34 +2 —26 | +157 

II —14 —35 +12 —11 +30 -—87 —24 —28 | —157/ 

I +27 +73 —6 —58 

5 —47 —-48 54 | 295 
8 +8 —27 —g2 | —207 -—307 

9 +67 +17 +81 | +124 

10 +92 +10 +24 | +129 
i — ahs 

4 —84 | —165 
+100 +100 
o 

















The equations for K, m’, the c’s and the d’s are as follows :— 
K = 6{14 445454 4} = 291 
se a tf @ 


~~ weap 9786+ : 10515+ 5 6641+ 3436 = 3007809 


oy = s[2116+ *{ 9286-8" x 300-7809} | = 52116 —311'3743= — 46-8743 


ope = 3 2010—-311-3743 = —60-1243 


fa = +] 1916+ *{ 1051578 300-7809} | = 1916—301 -6617= —27°9474 


1 
7 
d, = fae. — (+414) | = +38:1875 


dq = :| —58— ~ (—307) = —4'6310 
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The complete set of values for the c’s and the a’s are given in table 8. 











TABLE 8 
Values of c’s and d’s 
Group Parent ¢ d 

( 2 — 46-8743 +38°1875 
p 3 —60°1243 +4'9375 
\ 7 +31°2507 +15°3125 
11 —8-9993 —23°9375 
I —27°9474 —4°6310 
5 Fargas | —g6-4B8r 
q 8 +9" 240 —25°9167 
9 +3 6240 +21°3690 
10 —67°8046 +22°0833 
12 +31°2364 —2°4750 
. { 4 +46-9364 —14°7750 
$ 6 +12°7410 +8-3333 
-++0-0009 —0:0002 























The sums of squares for parents in the analysis of variance are 
calculated as follows :— 


S(cU) = —46-8743 x 2116+ ... = +196119 
$m’'S(U) = “=X 300*7809 x 29878 = +4493366 


I 2 I . 

— + {su }'= — -_ xa9878* = — 66 
NI (U) a 987 45545 

134919 

S(dV) = (+38-1875) x(+340)+...= 45372 


The sum of squares of deviations of the values in the body 
of table 7a, dev’u, is 319,721, and the sum of squares of the values of 
table 7b, S(v?), is 140,559. Introducing the further factor of 4 we 
obtain the analysis of variance shown in table 9. 

The main features of the data are now apparent. The mean square 
for sibs (a) is significantly greater than the remainder (a) at the 
5 per cent. level, and there is also some indication of a difference 
between sibs (4) and remainder (4), though this is not fully significant. 
Furthermore, remainder (a) is significantly greater than remainder 
(b) at the 5 per cent. level (e** = 1-99, 5 per cent. value = 1-70). 
There is thus clear indication of differences between sibs, which have 
a certain degree of consistency over all compatible matings. There 
is some indication of differences in effect between pollen and ova of 
the same sib, though these differences, if they exist, are not large. 
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Finally, crosses between individual pairs of sibs deviate somewhat 
from the additive law based on the average performance of either sib 
separately in all compatible crosses. 

TABLE 9 
Analysis of variance of data of table z 

















Degrees of | Sum of Mean 

freedom squares square 

( Sibs . II 67460 6133 

Diagonal sums (a) J Remainder : 37 92400 2497 
Total . " 48 159860 

Sibs : II 22686 2062 

Diagonal differences (6) | Remainder. 38 47594 1252 
Total . . | 49 70280 

















The final results may be presented in a table of values of $(¢+-d), 
3(c—d), 4c and d, giving respectively the estimates of the average 
effects of the male parents, the female parents, the mean of male 
and female parents, and the difference of male and female parents. 
The standard errors of individual comparisons may be calculated, 
if required, from the formule already given. Allowing for irregularities 
in the performance of the different crosses, i.e. basing the error variance 
on remainder (a) of table 9, the smallest of the standard errors for ¢ 
comparisons, that between sib 12 and sib 4, for example, is given by 


V(64—¢2) = = (2497 x2) = 31-62 


Similarly the largest of these standard errors, that between any 
member of group and any member of group g, is given by 


Vena) = |g tite t 1 _ 2x12(8—7)? 


8x4 7X2 29° a (2497 X 2) = 42°27 


The smallest and largest standard errors for the d comparisons are 
similarly given by 


wihsh wicca hae = 20:92 
V(d,.—d,1) u(= > +) X 1252 X2 = 20°9 
V(di—dy2) = “x 1252 X2 = 26-72 


Since the ¢ and d comparisons are independent the standard errors of 
the comparisons of 4(c+d) and }$(c—d) can be obtained from the 
above standard errors by the ordinary rules for the combination of 
variances of independent variates. 
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For many purposes average standard errors which are approxi- 
mately applicable to all comparisons will be adequate. These can be 
calculated directly from the mean square deviations of the values in 
table 8 and the corresponding variance ratios in table 9. The 
variance ratio for the ¢ comparisons is 6133/2497 = 2-46, and the 
mean square deviation of the ¢ values is 18883/11 = 1717. The 
average standard error of a single ¢ value is therefore approximately 
/1717/2°46 = +26-4. The average standard error of the difference 
of two values is therefore 4/2 times this or +37°3. Similarly the 
average standard error of a single d value is 4/518/1-65 = + 17-7 and 
of the difference of two values is +25:0. 

Examining the results of table 8 in detail, we see that five sibs, 
7, 5, 9, 12 and 4, give decidedly better performance than average. 
The crosses between these sibs have picked out a good proportion 
of the high values of table 7a, but there are some remaining, in 
particular those between 4 and 2, between 4 and 6, between g and 
11, between g and 6, and between 12 and 8 of which one parent is a 
sib not in this group. Again, some of the crosses between the sibs of 
the top group, in particular those between 7 and 5, 7 and 9, and 
g and 4, have given values decidedly below expectation. These 
inconsistencies are, in part at least, a reflection of departures from the 
additive law. If it is desired to pick pairs of parents which may be 
expected to give high fertility, we should in these circumstances give 
some weight to the performance of the individual crosses as well as 
the performance of either parent separately. We should also perform 
the cross in the direction which the d value indicates is most favourable, 
é.g. in a mating between 5 and 12 we should use sib 5 as the female 
parent. 


6. SUMMARY 


The paper describes the analysis of data obtained in plant breeding 
work when all possible reciprocal crosses between different lines are 
made. The cases discussed are: self-sterility, no self-sterility, self- 
sterility with incompatibility within groups of lines. The last case is 
illustrated by a numerical example. 
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l. INTRODUCTION 


A NUMBER of results are now available from systematic experiments 
on natural cross-pollination between varieties of crops grown for seed, 
e.g. on radish by Crane and Mather (1943) ; on radish and turnip, 
using various planting arrangements by Bateman (1947a) ; and on the 
wind-pollinated crops, beet and maize, by Bateman (1947). These 
data are in a consistent form which makes comparison and generalisa- 
tion possible. 

The conclusion that may be drawn from these experiments is that 
whatever the absolute level of contamination or the range of distance 
involved in any experiment, the shape of the curve relating con- 
tamination to isolation distance is the same. It is such that the rate 
of decrease of contamination per unit increase of isolation distance 
itself decreases with that increase. At first, increases in isolation 
distance rapidly reduce contamination, but at greater distances the 
contamination, though small, becomes persistent and in one instance 
(Bateman, 1947a) there was no detectable decrease in contamination 
when the isolation distance was increased from 50 yards to 200 yards. 

The consistency of these results justifies the present attempt to 
derive a general mathematical expression which, by the substitution 
of appropriate values for the constants, could express the variation of 
contamination with distance under any conditions and for any crop. 

In the following sections (the first of which contains new observa- 
tions on the pollinating methods of bees) the argument is roughly 
divisible into three parts :— 


(i) The discussion of the causation of the particular phenomena 
from which are derived one or more possible fundamental equations. 
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(ii) The choice of suitable simple equations which give «an 
approximation to the results anticipated from the fundamental 
equations. 

(iii) The testing of these empirical equations against experimental 
data. 

To the pure mathematician this may be no more satisfactory as a 
method than a purely empirical one. There are, however, many 
possible empirical formule to fit a given set of data. Some of these 
would immediately prove unsuitable when applied to another set 
of data using a different range of factors. There would still be a 
large residue whose unsuitability would only slowly be exposed by 
accumulating evidence. The method herein adopted does, if the 
fundamental equation is satisfactory, enable one to make at the start 
a good choice of empirical equations, which must pass the severe test 
of agreement with the fundamental equation. 

Since an attempt is made to discuss each phenomenon according 
to its causation, insect-pollination and wind-pollination have to be 
considered apart at first and, only at the end, taken together. 

For the sake of consistency the same symbols have been used 
throughout for the same variables. This has meant transposing the 
formule of other authors. 

The main symbols used are as follows :— 





F = the proportion of contamination 
D = the distance 

x = the power of D 

n = the insect density 

p = the pollen or spore density 


2. BEHAVIOUR OF POLLINATING INSECTS 


In insect-pollinated crops the effect of distance on cross-pollination 
is intimately connected with the behaviour of the pollinating insects. 
The connection is simplest in self-incompatible crops since with them 
all the functioning pollen has to be brought by insects. The proportion 
of contamination in the seed will then be equal to that in the available 
pollen on the insect visitor. There will of course be pollen in the 
baskets on the bee’s legs and on some other parts of its body which 
will not be available for pollination. In self-compatible crops the 
contamination of the seed will be diluted to a varying extent according 
to the amount of automatic self-pollination in the particular crop. 

The proportion of contaminant pollen on an insect must depend 
on the number of visits made after it has left the contaminating variety. 
The effect of distance on this depends on two factors: the flight 
habits of pollinating insects when at work, and (more difficult to 
observe) the rate of replacement per visit of contaminant by non- 
contaminant pollen. 














CONTAMINATION IN SEED CROPS 


Observations 


The first feature observed about the flight of a pollinating insect 
is that there are two distinct kinds of flight. This is most noticeable 
in bees, which are the most important pollinators. Most of the insects’ 
movement takes the form of slow flights which are easy for the observer 
to follow, and during which the insect hardly moves above the level 
of the flowers of the crop being visited. But every now and then 
an insect will soar up suddenly from the crop and disappear from 
sight. In hive bees these soaring flights probably mean the end of the 
forage and a return to the hive. By analogy, with bumble bees and 
solitary bees (such as Andrena), these flights would mean a return to 
the nest. In hover flies there is more doubt as to whether the two 
kinds of flight have distinct functions, but these insects are relatively 
unimportant in pollinating most crops. 

In general, it would seem that pollen carried forward from one 
forage to the next is not likely to be responsible for much of the seed 
set because a single forage includes a large number of flights ; and 
although there does not appear to be any evidence as yet on the 
amount of pollen carried forward from one forage to the next, after 
the bee has cleaned itself in the hive, this carry forward is unlikely to 
be great. Most of the effective pollen will be collected and deposited 
during a single forage. 

If information is to be obtained which will assist in understanding 
the process of contamination it is therefore the activity of the bee 
during the forage which needs most attention. 

For this purpose observations were made on hive bees, solitary bees 
and hover flies visiting a turnip plot in bloom. The plants were 
spaced 6 in. apart each way. Individual insects appeared to move at 
random over the plot as judged by the paths traced for single insects 
during a forage. There was apparent disorderliness, the same plant 
often being revisited at irregular intervals. 

Other observations made on bumble bees on a radish plot enable 
us to make a statistical test of randomness. Here the path of each bee 
was projected on to a line at right angles to the rows. Each flight was 
recorded in terms of its component parallel to this line. When bees 
did not cross from one row to the other the flight length was recorded 
as zero. Other flights were recorded as so many rows to the left or 
right. Three forages were analysed with respect to the direction of 
consecutive directed flights, 7.e. those whose direction was recorded. 
The results are summarised on the following page. 

Like follows like in 115 instances out of a total of 199. If the 
flights were completely random the expectation would be 99°5. There 
is therefore an excess of cases where consecutive directed flights were 
in the same direction. x? = 4:829 for one degree of freedom with a 
probability of 0-o2. In none of the three forages was there an excess 
of flights in one direction over the other. This means that, though 
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over longer periods the bees did not appear to move in any particular 
direction, over shorter periods the bees tended to have an over-all 














Subsequent flight 
Left Right 
First Left 55 42 
flight Right 42 60 














direction first one way and then the other. The deviation from 
complete randomness, however, though statistically significant, is 
not very great. 

The flight lengths of insects visiting the turnip plot were recorded 
in the manner illustrated in fig. 1. As the plants were 6 in. apart 









































Fic. 1.—The method of recording the flight lengths of pollinating insects in two dimensions, 
For explanation see text. 


each way the unit of distance was taken as 6 in. Any plant just 
visited is taken as being at position o in fig. 1. The next plant 
visited can be taken as being on a square with plant o as the 
centre. The figures 1, 2, 3 and 4 denote progressively larger con- 
centric squares whose sides are distances of 1, 2, 3 and 4 respectively 
from the centre. If the insect lands on a plant on square 2 the flight 
will be recorded as of length 2. Fig. 2 shows the distribution of 
flight lengths on turnip, of hive bees, solitary bees and hover flies. 
The distributions in the graphs appear similar for all the species. 
No record was kept of the length of the forages, but it can be stated 
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that hive bees foraged longest of the three insect types, and hover 
flies shortest. 
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Fic. 2.—The frequency distribution of the two dimensional flight lengths of various insects 
on turnip. D is expressed in units of 6 inches. 


Bumble bees do not visit turnip, possibly because the slender pedicels 
will not take their weight, but it would appear from observations on 
the radish that bumble bees forage longer than hive bees. 


Closeness of foraging 


The closeness with which insects forage is reflected in the distribu- 
tion of the points which they reach by single flights, about the plant 
from which the flights start. The closer the foraging, the narrower 
this frequency distribution will be ; and the wider the foraging, the 

U 











308 A. J. BATEMAN 
wider the distribution. The variance of the distribution can be taken, 
therefore, as a measure of foraging behaviour. An adjustment must, 
however, be made to allow for the two dimensional nature of flights, 
because the variance is measured relative to one dimension only. 

This is done by dividing the frequency of flights, 7, by a number 
proportional to the area of the zone into which the flight takes the 
insect. In Fig. 1 the boundaries of the zones are shown by dotted 
lines. The area of zone 4 is the area of the square enclosed by the 
outer zone boundary minus the area enclosed by the inner zone 
boundary. This is 9?—7?=32. In general the area of a zone is 
given by (2D+1)?—(2D—1)?=8D where D is the number of the 
zone (i.e. the distance of the plant from plant 0). This formula applies 
for all zones except zone 0 whose area is 1. If for all the other zones 
we use D as the correction factor for area, then the factor for zone 
o is 4. The corrected value for flight distribution is termed n which 
is the density of insects round a source after one flight. The value 
of n for D =o is halved for estimation of variance, since the other 
values of D represent only half the distribution. 

The bumble bee data from the radish plot with plants one foot 
apart gives the distribution in one dimension and therefore needs no 
correction. The observed frequencies give directly the distribution of n. 

The mean squares for the distribution of insects after one flight are 
(nD?) 

a 


calculated as Except in the bumble bee data, there is no 





means of estimating the mean, which is therefore taken as zero. D 
is therefore the deviation from the mean and no correction for the 
mean is required. As the mean is assumed, no degree of freedom is 
utilised in its estimation and the number of degrees of freedom is Zn. 

In 1943 the frequency of flights within zone D=0 was not recorded. 
The “‘ mean square” calculated from these observations is thus not a 
true mean square but is inflated by the omission of the class closest 
to the assumed mean. A comparison of these false mean squares will 
show, however, any differences in the degree of dispersal of the flights. 

Table 1 shows the values for 7 and n for six sets of observations 
with the calculations of the mean squares, true and false. The first 
three columns are strictly comparable, being based on the same 
plots at the same time. The next two differ from the first three only 
in pertaining to the following year. The true mean square for the 
bumble bee is six times that of the insects in columns 4. and 5. Here 
not only the years were different but also the crop, the spacing, and 
the method of recording, so little significance can be attached to the 
comparison. 

The first five sets of observations suggest that if all cross-pollination 
is carried out during forages and none between forages, the dispersal 
of contaminating pollen on a given crop will be independent of the 
type of visitor. It is possible, however, that some pollen is carried 
forward from one forage to the next. If at the same time one forage 
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of a visitor does not commence where its previous forage finished, 
conditions will be such that the shorter the forage the more wide- 


TABLE 1 
The dispersal of insect flights 


D=Fiight length. 

T=Frequency of flights according to length in two dimensions. 

n==Frequency of flights according to length in one dimension. 

The mean squares have been corrected for grouping. 

The frequencies actually recorded are in italic type: derived frequencies are in ordinary 





























type. 
: Solitary : : Solitary | Bumble 
Hive bees “ Hover flies | Hive bees See fos 
1943 1943 1943 1944 1944 1945 
D | y es me Y aie Sa e Pre n 
° wes ui | cop nest Peat’ | cee | SG SQASCOTam: open | “a6 
I 238 238-00| 52 52°00|2r 21°00\ 54 5§54°00| 27. 21°00! ... 
2 118 =659°00| 90 15°00\14 70O|2r 10°50|14 7:00| 169 
3 56 1867)10 3°33) 4 1°38) 5 1°67) 4 1°93) + 
4 16 4°00| 5 1°25] Fr 0°25] 2 0°50| © 0'25| 20 
5 12 2:40| 5 1°00 0:00| 2 0°40| © 0-20] ... 
6 5 0°83 0°00} 2 0°33| 2 O'17| 2 0°33 7 
7 5 o-71| © Org4| © O14] 7 0°14 es i 
8 5 063| r og] r o-1g 0°00 ie 5 
9 I Ovll 0°00| © Orr! 0°00 
10 0-00 0°00 0-00} 7 0°10 
II 0°00} r 0-09 0-00 ee nea Ree 
12 0:00 iia 0°00 was +z I 
13 I 0°08 sae 0°00 | 
14 I 0:07 aa I 0°07 
15 0:00 tes I 0°07 
16 2 O'1g 
22 e. I 
Total for all D ee ra ia 123 211°48| 66 122-11 | 709 
Total excluding | 460 324°63| 105 72°:94|47 30°43| 87 67-48) 43 30°11 
Dwm0.° 2.3 
True— 
Sum of squares hes ete aes 152°O1 81-85 2196-0 
Mean square . ani 7 ie 0°6355 0°5870 | 2:9306 
False— 
Sum of squares | 940°45 213°04 130°*41 152‘O1 81-85 
Mean square . 2°8137 2°8375 4°2023 21694 2.6351 





























spread will be the contamination. As noted above, the average 
duration of a forage varies greatly with the type of visitor. For 
example, under these conditions, hover flies would produce more 
widespread contamination than hive bees. 
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Theoretical 


The frequencies in table 1 may be used to test the validity of 
various formule which might express the flight of foraging insects. 
It is first necessary, however, to decide which formule are sufficiently 
appropriate to warrant consideration. Various formule have been 
previously proposed for similar phenomena, some on empirical 
grounds, some on theoretical grounds and some, after being derived 
theoretically, have been tested against actual data. 

An example of the purely empirical approach is that of Wadley 
and Wolfenbarger (1944) who observed the dispersal of the Smaller 
European Bark Beetle from a centre. They found that the regression 


equation n = a+, log D+ s gave a good fit with the data. 


In contrast, is the theoretical method of Pearson and Blakeman 
(1906) who considered the distribution of mosquitoes round a point 
source after r flights, on the assumption that all flights were of equal 
length (/) though random in direction. They found that as r increased 
the distribution approached a normal distribution. When r was 

—p* 
greater than 7 the theoretical formula would be n = & iad 
where Q is the total number of insects. 

The same relationship was assumed by Frampton, Linn and 
Hansing (1942) who were concerned with the spread of viruses of 
the yellows type by means of leaf-hoppers. They likened the move- 
ment of leaf-hoppers over a crop to the two dimensional diffusion of 
a gas according to kinetic theory. The distribution of leaf-hoppers 
round a point source would then be a normal distribution the variance 
of which increased With time. No direct test of the validity of this 
hypothesis was made, but they derived from it a formula for the 
spread of virus which showed good fit with their observations. 

Brownlee (1911) offered various criticisms of Pearson and 
Blakeman’s theoretical formula for insect flight. On the basis of 
these he proposed a formula of his own n= ge”. This formula 
was tested against data for the distribution of water-fleas and winkles 
at intervals after their liberation from a point source, and also for 
the distribution of epidemics both in space and time (substituting 
t for D). All these divers phenomena appeared to agree with 
his formula. 

From the above, it would seem that two basic formule merit 
serious consideration to decide their suitabilities for accounting for 
the above observations on the flight of foraging insects. These can 


be expressed in a common form, viz. n = ge~“?". According to 
Pearson and Blakeman, and Frampton ¢é al., x = 2; according to 
Brownlee, x = 1. Other values of x might also be considered. On 
empirical grounds it was felt desirable to test the fit when x = }. 
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These formule can be tested against the data by the method of 
regression. This is done by taking logs throughout. 


Thus logy) 2 = logy g—¢ logyy ¢. D* 
or log n = a+bD*. 


We can now estimate the regression of log 2 on D* for various 
values of x. The goodness of fit of several regressions can be compared 
by subtracting the sum of squares due to the regression from the 
total sum of squares for log n. The smaller the remainder sum of 
squares (which have the same degrees of freedom) the better the fit 
of the regression. The ratios between the remainders are treated as 
variance ratios and their significance estimated by reference to the 
usual tables. 


Fitting the data 


Subject to a few necessary modifications of the data, it is now 
possible to proceed with the estimation of the regressions for each 
of the six sets of observations given in table 1. These modifications 
are required owing to the fact that when n = 0, log n = — oo, which 


TABLE 2 
The regression of n, the frequency distribution of insect flights, on D*, D and Dt 


The sums of squares are the remainder sums of squares after the estimation of the 
regression. The smaller the remainder sums of squares the better the fit of the regression. 
The variance ratios are the sum of squares for the regression divided by the sum of squares 
of the regression of D+. 


























Regression of D? D pt 
log n on 
N 
Seri Sum of | Variance} Sum of | Variance| Sum of | Variance 
vale squares ratio squares ratio squares ratio 
Hive bees . 1943 | 1°475948| 7-22 0°456561| 2°23 0°204384| 1:00 8 
Solitary bees 1943 | 1°437808| 4°35 0°556384| 1°68 0330863 1-00 7 
Hover flies . 1943 | 2°551971| 2°74 1°467272| 1°58 0°930856! 1-00 8 
Hive bees . 1944 | 2°793632| 7:28 0°744612| 1°94 0°383599| 1:00 7 
Solitary bees 1944 | 2°484920| 5°66 0°860717| 1:96 0°438819| 1-00 6 
Bumble bees 1945 | 1°881934| 7°63 0*494034| 2:00 0°246597| 1:00 6 
Weighted mean eee 5734 a 1895 as 100 42 
variance ratio of 
all above series 
P. very small P. 0°05 























is useless for the purposes of a regression. Only the smaller values 
of 2, therefore, can be used. As will be seen from table 1 there are 
instances where n = 0 and adjacent values of n on both sides are 
positive. Arbitrary values have been given to nz of o-10 in columns 
2 and 3, and 1 in column 6. The bias introduced cannot be serious 
U2 
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as all six sets of data gave similar regressions. .These are given in 
table 2. A correction is required in the bumble bee data where all 
the frequencies except for D = o are the sum of two classes left and 
right. The frequency for D = o has therefore to be doubled before 
calculating the regressions. 

In all six sets of data the regression of log n on D? gives 
the best fit. A graphical representation of the fit of the three 
regressions can be made by calculating the means of the observed 
and estimated values of log n for the first five sets of data over 
the range of D from 1-6 inclusive. (The bumble bee data do not 
give values of nm over this range.) The means are then converted 
to their antilogarithms. The values are shown in table 3 and 


TABLE 3 


Comparison of observations with the expectations according to the 
regressions of log n on (a) D*, (b) D, (c) Dt 


The values of log n shown are the mean values for all data shown in table 1 except the 
bumble bee. The values of n are the antilogs of the mean log n’s. Expectation (c) is seen 
to be closest to the observed values of both log n and n. Discrepancies between expectation 
and observation at low values of D are exaggerated when n is calculated. 











D | I 2 3 4 5 6 
Observed | r69¢ 0g? 0453 T8399 1657 3497 
| 
Log n | Expected (a) . 1*138 1003 0*779 0°465 0-061 1-567 
(d) | — 1061 0°658 0°255 1-852 1-450 
(c) | 15! 0°937 = -0°493- OI'IQ 1-789 = 1-491 
canner Gaeecreeneameinn Hana 23 
Observed | 494 = -13°6 2°84 0°690 0454 0274 
n Expected (a) 13°7 10°07 6-o1 2°92 1°15 0:369 
(d) | 2g'1 II*51 4°55 1°80 o-711 0282 
(c) | 32° 8-65 grr 1°32 0°615 0°310 











graphically in fig. 3. Fig. 35 exaggerates the discrepancy between 
expectation and observation for D = 1 in regressions on D* and D. 
The regression on D? corresponds to a normal distribution as proposed 
by Frampton e¢ al. and Pearson and Blakeman. The regression on 
D is according to Brownlee’s formula. The magnitude of the peak 
at short distances appears to be even higher than suggested by Brownlee 
and is fitted still better by the regression on D}, 


The effect of several flights 


In the bumble bee data each flight was recorded in strict sequence 
and the direction was also observed. This permits a further analysis 
which has bearing on contamination. If the flights were completely 
at random the variance of the distribution after r flights should be 
r times the variance after one flight. It has been shown above that 














3 
, 
t 











CONTAMINATION IN SEED CROPS 313 


the bumble bee flights were not entirely random in direction. It is 
obviously desirable to know to what extent this non-randomness will 
affect the variance after r flights, which the bumble bee observations 
allow us to estimate. The data consisted of three portions each 
consisting of observations on a single bee during one forage. The 
first portion was very small. The second and third were of approxi- 
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Fic. 3.—Fitting insect flight to a formula. (a) Log n, (6) n plotted against D, where n 
is the frequency distribution in one dimension. The thick black line represents the 
pooled data of table 1, except for the bumble bees. Other lines represent the expecta- 
tions according to the regression of log n on D? (broken line), D (entire line) and D* 
(dotted line). 


mately equal size comprising 295 and 397 flights respectively. It 
was evident that the degree of dispersal of the second bee was much 
greater than that of the third. The two portions were therefore 
considered separately for their distributions of n,, given in table 4. 
2nD? 
an 

for grouping (—-0833). The number of degrees of freedom is 2, 
as before. 





The mean square is calculated as with Shephard’s correction 


The ratio - is fairly constant within each series, but showing a 
r 


definite upward trend with increasing r. The correlation coefficient 
between MS and r is actually 0-9982 in the second series and 0-9984 


in the third series. The tendency for —— to increase with r is a con- 
Tr 


sequence of the positive correlation already detected between the 
direction of successive directed flights. It will also be seen that as r 
increases n for D = o becomes smaller than n for D = 1. This would 
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not happen if flights were completely random.. The general effect 
is, however, a relation between MS and r according to expectation on 
the assumption of randomness. 
TABLE 4 
Distribution (irrespective of direction) of burzble bees after r flights 
The mean squares have been corrected for grouping 

















: Mean 
D in feet o I es 3 4 § 6 9 B-g t6 3°48 square MS/r 
r 
and Series} 1 | 195 73 12 7 5 I I 13487 1-3487 
2/195 102 28 12 11 2 I 1 1 | 2°8587 | 1-42 
3| 93 114 40 17 15 6 1 2 1 t 2 | 4°7984 | 1597 
4} 73 112 43 25 17 9 I 3 2 3 | 66143 | 1-6536 
5 | 60 105 41 go 22 13 e284 2 3 | 88064 | 1-7613 
Correlation coefficient between mean square and r=0-9982 
D in feet te) ao 2. wer foe 
r 
grd Series| 1 | gor 89 6 0°2021 | 0:2021 
2| 233 144 18 0°4635 | 0:2318 
3 | 188 179 27 6 0°7670 | 0°2557 
4|161 177 42 13 10923 | 0°2731 
5 | 138 175 60 17 2 1°44.73 | 0°2895 
Correlation coefficient between mean square and r=0-9984 























This enables us to obtain from the expressions relating n and D 
a more general expression concerning the distribution of n,. The 
usual expression for a normal distribution is 





t —(=—p)* 
Rae 
oV2n 
The formula herein used for the normal distribution when r = 1 is 
n= ro 


As it has been shown above that in bee flight the variance, o?, 
is proportional to 7, it will be seen that there is the following cor- 
respondence between the symbols used in the two expressions. 


(x—p) :D 


20% :" 


c 
vif 
pe 

7 














is 


r- 








CONTAMINATION IN SEED CROPS 315 


Thus to derive a formula for an n, which is normally distributed, 


we obtain 
—<¢D* 


r 





n, =. 


rt 


Similarly, allowance for r can be made in the general formula 


my = ge 
yw 
Then n, = Le r 


Tx 


The effect of the factor im is to compensate for the change in 
Tx 
slope of the distribution with varying numbers of flights, thus keeping 
the total number of insects constant. 


3. INSECT-POLLINATED CROPS 
Theoretical 


Formule have been derived which appear to express adequately 
the distribution of foraging insects. It is now necessary to see how 
these can be applied to the distribution of contamination. 

The general formule for the distribution of insects after r flights 
from the source can be taken also to express the relative frequency 
with which insects visiting a plant at distance D from a contaminating 
plant have taken r flights to cover the distance. If the symbol for this 
frequency is W,, then ZW, for all visits to a given plant will equal 
unity and it is necessary to adjust the factor g accordingly in the 
expression for n, given above. If there are many contaminant plants 
there will be an increase in the magnitude of WV, for small values of r 
at the expense of the larger values of r. Thus g will be increased 
and ¢ decreased in such a manner as to keep XW, equal to unity. 

At each visit an insect will deposit some pollen already on its 
body and replace it with a fresh supply. In this way the contaminant 
portion of pollen carried by an insect will decrease with every visit 
subsequent to its leaving the contaminant plot. If is the proportion 
of contaminant pollen on an insect leaving the contaminant plot, we 
can consider it to decrease to a fraction w at every subsequent visit. 
The proportion of contaminant pollen on an insect r flights away from 
the plot will therefore be vw’-1. In self-incompatible crops the 
effective r will be the number of flights up to the first visit to the 
plant now being visited, for successive visits to flowers on the same 
plant, though producing a decrease in the amount of compatible pollen 
carried, will not alter the proportion of that pollen which is con- 
taminant. In self-compatible crops, on the other hand, r will be 
the total number of flights. It will include or exclude the last, 
according to whether the insect picks up pollen from a flower before 
or after touching the stigma. 
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The expressions for the frequency of flights and the proportion of 
contaminant pollen on an insect can be combined to give the following 
expression :— 





vgw’-! —<P* 
>= = “Bs 


re 





p, is the contaminant portion of the pollen on a stigma deposited 
by insects which have travelled in r flights from the source of 
contamination. 

The contaminant portion of all insect-deposited pollen will be 
2p, for all values of r from one to infinity. There is frequently, 
however, some pollen deposited on the stigma of the same flower 
without first being picked up on to an insect’s body. The proportion 
of contamination in the seed, F, will therefore be (1—q)2p, where q 
is the proportion of compatible self-pollen deposited without the aid 
of insects. In self-incompatible species and many self-compatible 
ones with an out-breeding mechanism such as protandry, q = 0. 
In crops such as the French bean gq is almost equal to unity. The 
expression for contamination then becomes, by removing all factors, 
independent of r to outside the 2 : 

ww * —eb* 
F = v(1—9) ( —e?* ) 
1% 


r2=I-o 





The part of this expression included within the 2 sign cannot 
be simplified, but for any given set of conditions the expression 
vg(1—q) can be treated as a single constant. The manner in which F 
varies with D can be observed if we attribute arbitrary values to 
the constant, calculate the contributions to F by various values of 1, 
and sum them. 





Let p= g=(t—¢) =c=*s=!1 
Let w=0'5 

= 

er 
Then F = Z—- 

Q”- "7 


Table 5 gives the values of p, and log p, for the range of r from 1 
to 8 and of D from o to 25. From these are calculated 2p, and log 
=p, for r ranging from 1 to 8. The values of log p, and log 2, which 
is equivalent to log F, are shown graphically in fig. 4. 

It will be seen that though the relation between any log p, and D 
gives a straight line, the line for log 2p, is a continuous curve becoming 
parallel successively to each log p, curve. 

So far all the formule have been obtained on the assumption 
that the distributions of insects and contaminant pollen can be 
expressed as continuous curves of infinite extent. In other words it 
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has been assumed that in practice there are no maximum values of 
rand D. Are these assumptions true, and if not do the derived 
formule lose their validity ? 









































TABLE 5 
a 
Expected values for pollen distribution according to the formula p, = an’ r and 
P , D 
its equivalent log p, = —[¢-1) 0-3010+log r+ 7 04343] 
D ° 5 10 | 15 20 25 
r 
Log p, I ° 3829 5-657 7-486 5°314 11-143 
2 1-398 2-312 3°227 4141 5°055 7-969 
3 2-921 2196 3°475 4°749 4°024 5*303 
4 2-495 3952 3°409 4866 4324 5781 
5 2-097 3663 3-228 4°794 4360 5-926 
6 3717 3°356 4992 4°631 4°271 5-906 
7 3°349 3°041 4°728 4°420 4°107 5-798 
8 4990 4°716 4447 4173 5904 5-631 
pr I I 0:0067 007005 0:0%000 0:0f000 | 0:05000 
2 0°2500 0:0205 0:07169 0:0°138 o-0f114 0:0°093 
3 00834 0°0157 007298 005562 0:0°106 0-0201 
4 0:0313 0:0090 0°0°257 0:0°735 o-0%211 0:0'603 
5 00125 0-0046 007169 0-0°622 0:0°229 004842 
6 0°0052 00023 0:07098 0709428 0:0°187 004805, 
7 0:0022 o-o0o!1! 0:0°053 0:0°263 0:0°128 0:0'629 
8 0-0010 0*0005 0:0°028 0:0°149 0:0°080 0:04.27 
range of 
r up to 
ap 2 1°250 00273 007173 0:0°91399 | o-of114 0:0°093 
4 1°366 0°0519 0:07728 0°07144 0°0°328 004814 
6 1-383 0:0588 0:07995 007249 0:0°743 0:0°246 
8 1-386 0:0604 0:0108 007290 00°95! 0:0°352 
Log Lp 2 0°097 2-436 3238 4°142 5°055 7969 
4 0°135 2-715 3862 3°157 4515 | 591 
6 O141 2-769 | 3998 3°395 4871 4°391 
8 0°142 2-781 2-032 3-462 4°978 4°546 
| o't5 3-80 3-05 3°53 3°07 4°65 
| 

















The values for log Xp for r up to infinity are obtained by visual extrapolation of graphs 
showing the effect of increases in r on log Zp. The index numbers to the o’s signify the 
number of times they are to be repeated. 


It is possible that there is a maximum length to single flights, 
i.e. that there is a maximum range of D when r= 1. When, as in 
some wild species, plants are very widely spaced, this maximum is 
likely to limit the distance over which cross-pollination takes place. 
In a seed crop, however, the closeness of foraging is so great, that 
the frequency of such a maximum flight length being exceeded, if 
the distribution were continuous, is insignificant. Where plants are 
growing densely, therefore, the discrepancy between expectation and 
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fact due to the existence of a maximum flight length would be 
negligible. 

The existence of a maximum value of r beyond which contaminant 
pollen would not be carried depends on whether pollen is carried 
forward from one forage to the next. If pollen is not carried forward 
then the maximum value of r will be set by the number of flights per 
forage. This is already known to vary according to the species of 
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Fic. 4.—Variation of log and log F with D according to the equation 
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I 
bp = a ° 


Curves for p are shown for values of r from 1 to 8. The log F curve (thick line) is that 
estimated as log Zp for r ranging from 1 to ©, 


insect and is probably affected by other factors such as the amount 
of nectar obtained per visit. The effect of a maximum r would be 
that the curve relating log F to D would become parallel to the p, 
curve for the maximum 7. If successive forages by an insect continue 
' from where its previous ones finished, and there is a complete carry 
forward of pollen from one forage to the next there will be no 
maximum r. If there is an incomplete carry forward of pollen the 
contribution due to the higher values of r will be correspondingly 
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reduced. Even if there should be a maximum r it is likely to be 
relatively large for most pollinating insects. 

A further possibility is that consecutive forages are incompletely 
correlated in position or even uncorrelated. This could only have an 
effect if pollen were carried forward from one forage to the next. 
The result would be a flattening of the curve relating log F and D 
for the higher values of D, the degree of flattening being increased 
by decrease in the spatial correlation between forages and increase in 
the amount of pollen carried forward. If there were no correlation 
between forages the log F curve would eventually become parallel 
to the D axis resulting in a minimum contamination. This, however, 
is an impossible extreme as there must be some correlation between 
forages if only because bees will remain within a certain distance 
of their hive. The presence of wandering insects (not foraging 
closely) as postulated by Butler e¢ al. (1943) would also flatten the 
curve at greater values of D. 

It has been implied by Butler (zbid.) that there is a further limitation 
of insect flight—that foraging tends to be restricted to a given area. 
This would involve a maximum D, equal to the diameter of the foraging 
area, irrespective of the number of flights, r. There does not appear 
to be any evidence of this phenomenon and it need not therefore be 
considered further. 

There is a possibility of secondary contamination arising through 
insects picking up contaminant pollen from a contaminated flower. 
The effect would merely be to reduce the slope of the log F curve 
at all distances, and the appropriate formula would be the same. 

Even though all the above considerations involve speculation about 
circumstances not yet understood, none necessitate any radical changes 
in the formula for F as originally proposed. This can be restated in a 
simpler form, using A as a constant for any particular pair of con- 
taminant and contaminated plots. 


uf ~* —<D* 
rs 


F= kz 





- 
rx 


rei @ 


The evidence of the insect flight data is that the most appropriate 
value for x is 1 or 4. 


Empirical 


The next problem is to decide on an empirical formula which will 
closely parallel the effects of the above fundamental formula. As 
can be seen from fig. 4, when the theoretical log F is plotted against 
D a curve is obtained which deviates by upward curvature from a 
straight line, the deviation being the effect of successively higher 
values of r. Fig. 3 shows a very similar curve corresponding to the 
regression of log r on D* which is derived from the equation n=ge-?. 
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The corresponding expression for F is yet, A similar. upward 


curvature would be a feature of the regression equation 
-kD 
e 


log F=a—bD—log D. This corresponds to F =~ 


These empirical formule can be tested for conformity with the 
theoretical formula in the following way. From the values in table 5 
one can calculate the values of log Zp for the following ranges of r: 
I—2; 1—4; 1—6; and 1—8 for values of D from 5 to 25. (D=o 
is an impossible value for actual cross-pollination and the corresponding 
value of log F is in another sense unreal, being positive, which would 
make F>1.) It is then possible to construct graphs for each value 
of D showing the effect of increasing the range of r on log Zp. By 
extrapolation of these graphs we obtain the values of log Zp to be 
expected when the range of r is complete (i.e. 1-00). This method 
is obviously subject to personal error in the extrapolation. However, 
it is now possible to treat these estimates of log Zp as “ observed ” 
log F and to test the goodness of fit of the two empirical equations 
by the method of regression. 

For the equation, F=ye~*”! the appropriate regression is that of 
log F on Dt, for the values of D from 5 to 25. The total sum of 
squares for log F is 2-8288 whilst the sum of squares due to the 
regression is 28279, leaving a remainder sum of squares for 3 degrees 
of freedom of only 0-0009. The fit between the fundamental and 
empirical equations is thus very good, though the extremely small 
remainder may be deemed a coincidence. 


ye -—&D 


Similarly, for the equation F = the appropriate regression 





is that of (log F+log D) on D. The total sum of squares of 
(log F+log D) is 1-2875, whilst the regression sum of squares is 
1-2816, leaving a remainder of 0-0059 for 3 degrees of freedom. This 
still represents a good fit. The difference between the two remainders 
is not significant. It must be concluded that both empirical equations 
give a good fit with the fundamental equation and no choice can be 
made between them at this stage. 

Comparison of these empirical formule with the original shows 
that the coefficient » corresponds to the part outside the 2 sign, 
namely vg(1—q). These are respectively the proportion of con- 
taminant pollen on an insect leaving the contaminant plot; a 
constant derived from the insect density formula which decreases 
with increasing dispersal of the insect flights; and the proportion 
of compatible pollen which is deposited by insects. The compound 
coefficient y expresses the contamination at zero isolation distance. 

The coefficient k corresponds to those parts inside the 2 sign, 
namely w, c and x. These are respectively the diminution of con- 
taminant pollen with each successive forage; the inverse of the 
degree of dispersal of insects (i.e. the closeness of foraging) ; and the 


















CONTAMINATION ‘IN .SEED CROPS 321 


power of D which expresses most closely the effect of D on insect 


foraging. The degree of dispersal of the insects, -, is determined 


largely by the way in which the plants are arranged, whether densely 
or sparsely, for example. The compound coefficient k expresses the 
rate of decrease of contamination with distance. 

Before proceeding further to test these formule by comparison with 
the data on contamination, attention should be paid to any formule 
hitherto proposed, which might have a bearing on the question. 
From their formula for the distribution of leaf-hoppers, Frampton 
et al. derived a formula for the spread of virus disease. The assumption 
was that the infection J was directly proportional to time ¢, and to 
the number of insects feeding. On this basis the following formula 


was obtained :— 
I = yte*!-*2 


This situation must inevitably differ from cross-pollination, for 
an insect vector can continue to produce a new infection with each 
fresh bite, whereas in the process of contamination the pollinating 
insect steadily loses its ability to contaminate. The time factor is 
therefore of no importance (provided, of course, that the contaminant 
and contaminated varieties are synchronous in flowering period). 
If we are to apply the formula for virus infection to contamination by 
cross-pollination we must adjust the expression to F = ye”, 

This formula is supported by the experimental evidence of 
Currence and Jenkins (1942), who observed the effects of distance 
on contamination in a cross-pollinating variety of tomato. They 
found that a good fit with the results was obtained by using the 
regression equation log F=a+bD where } was negative. This is 
merely the logarithmic form of the above modification of Frampton’s 
formula. 

There appear, therefore, to be three equations which merit testing 
for their agreement with observation : 


F=a 





J 
F =z yet 
-kD 
F=~ 
D 


Fitting the data 


We are now in a position to test these formule. Fig. 5 shows 
log F plotted against D for a varied collection of contamination 
experiments described elsewhere (Crane and Mather, 1943 ; Bateman, 
1947 a and 6). As zero contamination gives an infinitely negative 
logarithm it has often been necessary to pool the data in groups of 
adjacent distances so as to give a finite logarithm. The mean D for 
the group is then used in the graph. 
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Fic. 5.—Contamination in insect-pollinated crops. Log F is plotted against D in feet. 
(a) Radish (Crane and Mather, 1943), (4) turnip, and (c), (e) and (f) radish (Bateman, 
19474), (d) beet (Bateman, 19475). 
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It will be noticed that fig. 5 (d) concerns beet, which is generally 
regarded as wind-pollinated. It has been included because in the 
experiments it was found that the beet flowers were visited for the 
nectar discs by numerous flies. In large fields insect-pollination of 
beet is not likely to be important. 

-kD 

The suitabilities of the expressions ye~*”, yet, and ~ can be 
compared by the method of regression. On taking logs these become 
log y—k log e.D, log y—k log e.D*, and log y—k log ¢.D—log D 
respectively. We can therefore estimate the following regressions :— 


log F on a—b,D 
log F on a—b,D* 
and log F on a—b,D—b, log D 


where a is an estimate of log y, and 5, is an estimate of k log e. 
According to hypothesis the coefficient of log D in the third regression 
is unity. Furthermore a comparison of regressions can only be made 
if they are based on the same number of degrees of freedom, in this 
case, one. The third regression is therefore modified to that of 
(log F+log D) on a—bD. The sum of squares due to each regression 
is subtracted from the total sum of squares.of log F or (log F+log D) 
and the smaller the remainder sum of squares the better the goodness 
of fit obtained by the regression. 

The remainder sum of squares for log F is equivalent to the (log F 
observed —log F estimated) .? 

The remainder sum of squares for the regression of (log F-+-log D) 
on D is therefore Z[(log F observed+log D) —(log F estimated+log 
D)]|?. The log D’s cancel out giving 2(log F observed—log F 
estimated)*. The three remainder sums of squares are therefore 
strictly comparable. Table 6 shows the remainder sums of squares 
of the three regressions for all the data shown in fig. 4. There is 
never any significant difference between the second and third 
regressions. Except in series (¢), which concerns distances of 7 feet 
and less, the second and third regressions have a distinct advantage 
over the first. It has been postulated above that the higher values of 
r (which produce the upward curvature of the log curve) become 
of increasing importance as D increases. It is not surprising, therefore, 
that the straightest curves (those fitted best by the regression of log F 
or D) are obtained for the shortest distances. 

The data on the effects of distance on the spread of contamination 
are thus in conformity with two hypotheses :— 


(1) That bees and other pollinating insects forage in a manner 
—cD* 





which is described by the expression Zz. . A better fit is given 


Tx 
when x = } than when x = I. 
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(2) The amount of contaminant pollen carried by an insect 
decreases regularly with successive visits to flowers of the other 
variety. 


TABLE 6 


The remainder sums of squares for three types of regression for the 
contamination data represented in fig. 4 


In all series the variance ratio is calculated, for the sake of an easier comparison, using 
the sum of squares remaining after the regression of log F on D* as the denominator. 
The variance ratios for (a)-(d), (¢) as a whole and (/) as a whole are the weighted means 
of the variance ratios of the parts. The probability for the ratio of the “ D” remainder 
to the “ Dt,” remainder is given for these pooled ratios. 















































. F ( F+ Log F 
R Log 
— on D log D) on D on Dt 
: Degrees | Sum of | Variance Sum of | Variance| Sum of | Variance 
Series of 

freedom | 84uares ratio squares ratio squares ratio 
a). ‘ , 18 0°427978 | 4r4r 0*361499 rig 0°304014 1-00 
6). : ‘ 7 0°567747 3°22 0°238632 1°35 0°176307 ro 
6) . : . II 1*4526 1°64 0°699272 0-79 0°887747 100 
d). ; ; 12 1166501 | 2°28 0°460055 0-90 0°511037 100 
(a) to (d) , 48 P=o-01 1'944 1049 1000 
e) EA 12 0:286398 | o-82 0°375506 1:08 0347283 100 
e) EB 6 0:060359| 0°74 0°101670 1°25 0:08133 1°00 
e) WB . 6 0°070510| 103 0°043749 064 0:06864) 1:00 
(e) WC . 4 0°050497| 1°22 0°043852 1:06 0°041465 1°00 
(e) asa whole . 28 P large 0°905 1019 | 1000 
t¥ W 10 0°399517| 2°89 0°158750 reg eee | 100 
is) ae . 9 1544541 | 1°97 0°516786 0-66 0782717 1:00 
fy Si. ‘ 6 1272825 1°69 0°754156 1:00 0°753! 100 
pan | * 4 0°794073| 1°84 0+286966 0°66 0°432 1:00 
(f) as a whole. 29 P=o0-01 | 2-2Ir | 0-899 | | 1000 














In accordance with these hypotheses the relation between con- 
tamination F and distance D can be expressed empirically by either 


—-kD 
of the equations F = yet or F =? 55 . When the distance is 





not more than a few feet the relation is equally well expressed by 
F = ye”. 


4. WIND-POLLINATED CROPS 


(a) Pollen dispersal 
Theoretical 


A detailed discussion of the mechanics of the dispersal of air-borne 
fungal spores has recently been published (Gregory, 1945). Basing 
the arguments on formule proposed by Sutton (1932) for the dispersal 
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of a smoke cloud from a point source, a formula has been obtained 
for spore dispersal. The main feature of the discussion is the 
importance attached to atmospheric turbulence as a dispersing agent. 
The following formule are given :— 


(2) for deposition when the rate of deposition has a negligible 
effect on the amount of spores in suspension. 


p= - 2 


TEC DIO +) 


Where p = the mean number of spores deposited per unit area in all 
directions from a point source. 
z = the proportion of suspended spores deposited as a spore 
cloud passes over unit area. 
Q = the total number of spores. 
D = the distance from the point of liberation of the cloud. 
C = an atmospheric constant. 
m = a factor varying between 1-24 when turbulence is at a 
minimum and 1°76 when turbulence is at a maximum. 
(b) for the change of total suspended spores as deposition proceeds 


| 


Q,>= o fies 


Combining the two expressions we obtain the complete formula for 
the dispersal of air-borne spores. 

Modification of this formula are deduced to account for deposition 
down wind from a point source (p,,) and deposition down wind from 
a line source (f,,,). 





_ 2%Qp 
These are i, = ap» 
_ 22Qp 


where Q , is derived from Q , as above. 

Doubt has been thrown by Bosanquet and Pearson (1936) on the 
practical importance of eddy diffusion or atmospheric turbulence. 
They assert that when there is a continuous source and observations 
are made over a period of time which allows for considerable variation 
in wind velocity, wind variation predominates over eddy diffusion in 
determining the resultant dispersal. If attention is paid to dispersal 
in all directions no account need be taken even of wind direction. 
Over a given period, then, during which Q pollen grains are liberated, 
ignoring loss through deposition, the grains will pass outwards in two 
dimensions under the influence of wind so that the number of grains 
Passing over unit circumference at distance D from the source will be 
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* Allowance can be made for deposition by assuming that the 
7. 


amount of suspended grains is reduced by a certain proportion per 


unit distance travelled from the source. 

Then p = — or in general form e. 

27D D 

This formula is the same as is obtained from Gregory’s expression 
by making m equal to zero. Taking logs this equation becomes of 
the form log p = a—b,D—log D. This means that when log # is 
plotted against D a line will be obtained curving upward from a 
straight line of negative slope. 

Another formula for eddy diffusion in a stationary cloud has been 
proposed by Schmidt (1925). This is only the normal distribution 


in another form. 
—p? 
“4a 
a (#) 
P= onl Ant 
p 





where # is the spore density at distance z from the source at time ¢, 
Q is the total number of spores, and A and p are atmospheric constants. 
It may be written as p = ge~*”". 

If Schmidt’s expression is used and allowance is also made for 


deposition rate 
p ae ae 


If log is plotted against D, this equation will give a line 
curving downward from a straight line with negative slope 
(log p = a—b,D—b,D*). Reference to fig. 5 shows that where 
definite curvature from a straight line is discernible it is always in 
the upward direction. This equation may therefore be dismissed. 

We are left then with the formule proposed by Gregory and the 
modification which ignores the effects of turbulence (assumes m = 0). 

Gregory’s formula can be simplified in the following manner :— 

—-p*—™) 
etm) 


ees 


where g varies with the amount of pollen liberated and the rate of 
deposition, ¢ varies with rate of deposition and m varies with the 
turbulence. 


Fitting the data 


If observations are available for a sufficient number of distances, 
so that the number of degrees of freedom will allow a comparison, it 
is possible to compare the adequacy of formule for p by the method 
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of regression. Gregory’s formule can be made suitable in form for 
regressions by taking logs. 


Thus log p = log g—(1+4m) log D— cD®-*™, 
Hence log p+-(1-+4m) log D = a—bD*-™ 
Similarly 

log p,,+m log D = a—bD*-*” 


and log pry + = log D = a—bDe-™ 


The maximal and minimal values for m (%-76 and 1:24 respectively) 
can be inserted and the regression of the left-hand side on the right- 
hand side calculated. The corresponding regression equations on the 
assumption that turbulence is without effect can be obtained by 
making m = 0. 
These are 

log p+log D = a—bD 

and log p,, = log p,,, = a—bD 


As has been shown earlier, the sum of squares of the deviations of 
observed (log +log D) from expected (log +logD) is the same as 
the sum of squares of the deviation of observed log from expected 
log p, as the log D’s cancel out. The remainder sums of squares for 
any of these regressions applied to the same data are therefore strictly 
comparable. 

Incidentally the method of regression makes possible the estimation 
z 
C 
assumed value of m provided that all measurements are expressed 
in the same units. 


of the values of = and Q,, in Gregory’s equation corresponding to any 














_ 2z loge Mame «oe! bat (1 —4m) 

ad ~ wtC(1—4m) ** Co aloge * 
= ss " oe __ antilog a.27?.log e 
Similarly a= log (2) — = a aa 


Material suitable for the estimation and comparison of regressions 
is available in the maize data from a previous paper in this series 
(Bateman, 19475) as shown in fig. 6a, and the data of Jensen and 
Begh (1941) for rye (fig. 60) and cocksfoot (fig. 6c). A great deal of 
data for fungal spores is presented by Gregory, but it is mostly unsuitable 
for our purpose as either the number of observations is too few or the 
distances are expressed too vaguely. 

In the maize data p is the total deposition in three directions at 
right angles to one another over a period during which the wind 
varied considerably. The most appropriate regression should there- 

K2 




















O 4 
COCKS FOOT 





Fic. 6.—Dispersal of pollen by wind. Maize data from Bateman (1947b); rye and 
cocksfoot from Jensen and Bogh (1941). The units of D are 1o feet for maize and 
100 metres for rye and cocksfoot. Heavy lines represent the observations. Other lines 
represent the expectation according to regressions based on the following assumptions :— 


maize ; when m = 0, entire line rye and cocksfoot ; $;,, when m = 0, entire line 
p when m = 1°76, broken line Po When m = 1°76, broken line 
p when m = 0, dotted line. 
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fore relate to dispersal in all directions from a point source. The 
following are the estimated regressions :— 





Remainder 
sum of 
squares | Variance | Probability 
(5 degrees ratio 

















freedom 
Not down-wind, point source— 
m=0 pest hel =3°870—0'23104 D 0*121035 4°82 0°05 
pant ae 2 log D=4°662—0-°85024 D®* | 0028951 I°l5 a8 
m=1+76 log p+-1°88 log D=6-573—2-72272 D2 | 0-025129 1°00 








If m is given the values 1-24 or 1°76 a better fit is obtained than 
if eddy diffusion is ignored. The differences between the expectations 
of p and log when m varies between 1-24 and 1-76 are very slight 
as can be seen in table 7. The apparent importance of turbulence is 
the more remarkable in that the actual data suggested otherwise for 
nearly all pollen deposited was on the slide facing towards the source 
rather than that facing away from it. 

All the data presented by Jensen and Begh concern the down-wind 
distribution of pollen. It is not clear, however, whether the field was 
large compared to the distances at which pollen deposition was 























Assumptions Remainder 
Rye equations = eae —_ 
Circum- = (5 degrees — — 
stances of freedom) 
Down- o |Logp =1+723—0-10961 D 0°029504 2°24 o-2 
wind from 
point or 
line source 
Notdown-| o | Log /+log D =1°722-+0:02293 D 0017378 1°32 High 
wind from 
point 
source 
—_ 1:24 | Log p-+-0°62 log D=1-860—0°15189 D8 0013164 1°00 
wind, 
line source 
— 1-76 | Log p+0-88 log D=1-509+0:°20775 D®12 0°013817 1°05 High 
win 
line source 


























recorded. Let us first consider the rye data as down-wind distribution 
from a point source (p,). The left-hand side of the regression 
equation is then log p+-m log D. A preliminary examination of the 
rye data showed that if the value of 1-24 or 1-76 was given to m in this 
regression, the left-hand side increased with D; in other words the 


' a oe lee 
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coefficient } is positive, implying multiplication of the pollen after 
liberation, instead of diminution. If m = 0, however, the regression 
remains plausible. If, on the other hand, the distribution is con- 
sidered as being down-wind from a line source at right angles to the 
wind, the left-hand side of the regression equation becomes log 
p+4m log D and is plausible for values of m between 1°24 and 1°76. 
We may further query the constancy of wind direction during the 
experiment (strict p). The coefficient of log D then becomes $(m+-2) 
and the only value of m which then fits the data is zero. 

The possible regression equations for the dispersal of rye pollen 
are shown on page 329. 

It will be noticed that in two regressions the 6 coefficients are 
positive. When both coefficients are compared to their standard 
errors, however, they give non-significant ¢’s. This means that 
according to the particular regressions, loss of pollen orne deposition 
is insignificant. 

In the cocksfoot data also, on the same grounds there are four 
possible regression equations :— 
































| Assumptions Remainder 
as sum of : 
| Cocksfoot equations squares ee Beis 
Circum- ie degrees Y 
stances - freedom) 
Down- . Log p =1'582—o-12511 D 0*145988 1°53 High 
wind from 
point or 
line source 
Notdown-| o Log p+log D =1'678—0-02212 D 0'099631 1+04 
wind from 
point 
source 
Down- /|1°24| Log p+o0-62 log D=2-044—0°40188 D®* | 0:095554 100 
wind from 
line source 
| Down- 1°76 | Log p+-0°88 log D=2-604—0:93394 D2 | 0*106150 Iv 
wind from 





line source | 




















It will be seen in table 7 that for both rye and cocksfoot the last 
three regressions give very similar expectations of p. 

Summarising the pollen distributions in their relation to possible 
regressions the following statements can be made :— 


(2) With the data as a whole the best fit is obtained using regres- 
sions based on the assumption that eddy diffusion is the 
basis of pollen dispersal. Whether the atmospheric turbu- 
lence is assumed to be at a maximum (m = 1-76) or at 
a minimum (m = 1-24) the fit is equally good. The 
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; corresponding values of z, the rate of deposition and Q,, 
L the number of pollen grains liberated, vary greatly, however, 
. according to the assumed value of m, as can be seen by the 


differences in the estimated values of a and 6 corresponding 
to the extreme values of m. 


TABLE 7 


ed 


—— «ae. ae 


Comparison of observation of log p and expectation according to several 
assumptions (see text) 





D (arbitrary units) | I | 2 


Maize pollen 






































Observations | 3865 3°03 | 2°533 | 2°185 | 2-083 | 1-748 | 1-477 
Expectations 
Not down-wind, point source m=o | 3°639 | 3°107 | 2°700 | 2-344 | 2026 | 1-706 | 1-408 
’ arr 6 Py >> «= M=1'24) 3°B12 | 3068 | 2°599 | 2-247 | 1-962 | 1722 | 1513 
» 99 ‘3 oe »»  m=1-+76| 3°850 | 3°050 or | 1°953 | 2°734 | 1°546 
Rye pollen 
— “| a 
ity Observations 1*704 | 1-488 | 1-308 | 1-198 | 1-247 | 1-061 | 0-986 
Expectations | 
__| —Down-wind, point or line m=o 1°613 | 1°504 | 1°394 | 1°285 | 1-275 | 1-065 | 0-956 


source 
th | # Not down-wind, point source m=o 1°745 | 1°467 | 1-323 | r-azz | 1°197 | 1°081 | 1-037 
\Down-wind, line source m=1:24| 1-708 | 1-476 | 1°334 | 1-230 | 1°147 | 1°078 | 1-018 





























i a ae as m= 1°76) 1°717 | 1°470 | 1°326 | 1-224 | 1°146 | 1-082 | 1-027 
Cocksfoot pollen 
Observations 1+538 | 1-425 | 1263 | 1-080 | 0-644 | 0°740 | 0-700 | 0°617 | 0°473 
= Expectations 
Down wind, point or line m=o 1°457 | 1°331 | 1:206 | 1-08 | 0°956 | 0°831 | 0-706 | 0-581 | 0°3931 


source 
—| || Not down-wind, point source m=o 1-656 | 1°333 | 1°135 | 0°988 | 0-868 | 0-767 | 0-678 | 0-598 | 0°457 
; Down-wind, line source m=1°24| 1°642 | 1°334 | 1°138 | o-gg1 | 0°870 | 0-768 | 0-678 | 0-598 | 0-460) 
mas | oe 8 m=1+76| 1°670 | 1-324 | 1°118 | 0-971 | 0-855 | o- 











760 | 0-680 | 0-610 | 1-493 


























The observed values and those expected values which accord most closely with them 
are shown in clarendon type 


(b) In the two sets of data concerning down-wind distribution an 

equally good fit is obtained if turbulence is ignored and if 
the wind is assumed to be variable in direction. 

(c) Though in every instance the worst fit is given in comparable 
regressions when turbulence is ignored, the fit obtained might 
be good enough for many purposes. 


Before a final decision can be reached on the most appropriate 
regression, which might indicate the most fundamental equation, 











332 A. J. BATEMAN 


it will be necessary to obtain more accurate data over a sufficiently 
wide range of distances. At the present stage the evidence is in 
favour of Gregory’s assumptions regarding the importance of 
atmospheric turbulence. 


(b) Contamination 


The derivation of F, the proportion of contamination, from #,, 
the concentration of contaminant pollen, is much simpler than the 
equivalent derivation of F from n the frequency distribution of 
insects. We can assume that the concentration of non-contaminant 
pollen, #;, is independent of the distance D from the contaminant. 

F 
1—F 


p; is constant Po and — will vary with p,. We may therefore write 


If 





Now F = —/2_ which can also be expressed as ° = 
aor 


poe for contamination in all directions from the source, 


F 


Log 











3 n 1 1 I 
e) 20 40 60 80 
D 


Fic. 7.—Contamination in maize. Log — plotted against D in feet. Data in 


Bateman (19476). The entire and broken lines represent replications. 


where in the simplest case m= 0. There is no need to make any 
detailed examination of the contamination results in the two wind- 
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pollinated crops beet and maize. Since the error variance of the 
contamination is much higher than that of pollen distribution a 
comparison between various regressions will be much less sensitive. 
It has been seen (fig. 5¢) that the beet curve for log F against D shows 


upward curvature. Since F is small the curve for log would be 


very similar. The results are thus compatible with either insect or 
wind pollination in this crop. 


Fig. 7 shows log a plotted against D for two sets of maize 


data from the North and South arms of the same experiment. Only 
those ears which silked before 11th August (which proved to be the 
data at which contamination almost ceased) are represented. The 
curves are very erratic in comparison with curves for pollen con- 
centrations, because whilst in the former each ear represented a 
sample of pollen taken at its own time of silking and with local 
variation in amounts of non-contaminant pollen, in the latter all 
samples were taken simultaneously with only one source of pollen. 
The erratic nature of the results makes the calculation of a regression 
superfluous. By appearance the WN curve is almost straight, whereas 
the S curve shows slight upward curvature. The results are at any 
rate not incompatible with the expectations based on pollen studies. 


CONCLUSIONS 


It is now possible to compare the formule derived for insect- and 
wind-pollination. In insect-pollinated crops two formule appear 
equally suitable : 


kD 
F=ye~2t and F=_ 


D 
In wind-pollinated crops the corresponding formula is 
AES st 
1—F D 


In spite of the very different ways in which these formule have 
been derived, they all give similar results in practice. If F is small 


so that F= — one formula can be used in all cases, viz. : 


ye*P 


Fe 





If this formula is expressed as a graph in which log F is plotted 
against D (see fig. 4) a curve of negative slope is obtained, its steepness 
decreasing as D increases. If the curve relating log F to D were a 
straight line of negative slope the proportionate decrease in contamination 
with unit increase in distance would be constant. The proposed 
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formula means, therefore, that even the proportionate decrease in 
contamination becomes less with each increase in distance. In 
practical terms successive increases in isolation distance become less 
and less effective in improving the isolation. If, for example, con- 
tamination is complete for no isolation distance, and 100 yards reduces 
it to one in a hundred, the contamination at 200 yards will be greater 
than one in ten thousand. 

The practical importance of the proposed formula to seed growing 
would be in the prediction of the contamination at any isolation 
distance provided the contamination at two distances were known 
with sufficient accuracy and that other conditions were constant. 
Under given conditions it would be possible to predict the distance 
at which contamination of one in a hundred or one in a thousand 
would be obtained. 

The formula also has a bearing on population genetics. It gives 
the distribution in a continuous population of the pollen parents of 
the progeny of an individual seed parent. The nature of this 
distribution suggests that if the seed were not widely dispersed, there 
would be a correlation between the proximity of two plants and the 
closeness of their relationship. Even in an infinite and continuous 
population, therefore, gene combinations would spread slowly and there 
would be slight inbreeding. In a population of finite size more genetic 
variability and greater inbreeding would occur than would be 
expected on the basis of population size alone. In the past, the 
occurrence of greater variability than was anticipated from the 
population size has been explained by the assumption of an internal 
discontinuity in the population (Sewall Wright, 1939). A limited 
dispersal of pollen or seed would effectively explain the same situation. 

It is worth noting that in a seed crop which is harvested in bulk, 
mixed mechanically and resown, there is no correlation between the 
proximity of plants in the field and their relationship. Crop plants are 
therefore probably the only examples in which the total number of 
plants in a population really corresponds to Sewall Wright’s WN statistic. 


SUMMARY 


Insects and air produce pollination by very different means which 
can be contrasted as follows :— 

1. Insects move independently of one another, but the experi- 
ments show that they do so in a statistically predictable manner. 
Air, on the other hand, moves in large masses broken up by turbulence, 
the effects of which are also predictable; but the movement is 
modified by variation in wind direction and velocity which is 
unpredictable. 

2. Insects carry pollen systematically from flower to flower of the 
same species : under normal wind conditions, if the species is evenly 
dispersed, the pollen is distributed equally in all directions. Air-borne 
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pollen, on the other hand, is distributed down-wind and alights on a 
stigma without regard to its species. 

3. An insect can carry only a limited amount of pollen available 
for pollination. Consequently the amount of pollen of one variety 
on an insect can increase only at the expense of other varieties. In 
the air, on the other hand, the amount is almost unlimited. The 
atmospheric concentration of pollen of one variety, therefore, has no 
direct influence on the concentration of another. 

Bearing in mind these distinctions, formule are derived for the 
effect of distance on contamination in insect- and wind-pollinated 
crops. These agree well with the experimental results now described. 
The formule for the two classes of pollination are unexpectedly 
similar ; so similar indeed that one can derive a common formula 
for the two. 

With this formula one can use the contamination observed at two 
distances to predict what will be found at a third. The formula 
also throws some light on the breeding behaviour of natural 
populations. 


Acknowledgments.—This work has been carried out under the auspices and financial 
assistance of the Agricultural Research Council. The author also wishes to express 
his gratitude to Dr K. Mather for his guidance during the work. 


REFERENCES 
BATEMAN, A. J. 19474. 
Contamination in Seed Crops. I. Insect Pollination. 
F. Genet. 48, 257. 


BATEMAN, A. J. 1947). 
Contamination in Seed Crops. II. Wind Pollination. 
Heredity 1, 235. 


BOSANQUET, C. H., and PEARSON, J. L. 1936. 
The spread of smoke and gases from chimneys. 
Trans. Faraday Soc. 32, 1249. 


BROWNLEE, J. IQII. 
The mathematical theory of random migration and epidemic distribution. 
Proc. roy. Soc. Edinb. 31, 262. 


CRANE, M. B., and MATHER, K. 1943. 
The natural cross-pollination of crop plants with particular reference to the radish. 
Ann. appl. Biol. 30, 301. 


CURRENCE, T. M., and JENKINS, J. M. 1942. 
Natural crossing in tomatoes in relation to distance and direction. 
Proc. Amer. Soc. hort. Sci. 41, 273. 


FRAMPTON, V. L., LINN, M. B., and HANSING, E. D. 1942. 
The spread of virus diseases of the yellow type under field conditions. 
Phytopathology 32, 799. 


GREGORY, P. H. 1945. 
The dispersion of air-borne spores. Trans. Brit. mycol. Soc. 28, 26. 





















336 





JENSEN, I., and BOGH, H. 1941. 
On conditions influencing the danger of crossing in the. case of wind-pollinated 


Tidsskr. Planteavl. 46, 238. 

PEARSON, K., and BLAKEMAN, J. 1906. 

Mathematical contributions to the theory of evolution. XV. A mathematical 
Drap. Co. Mem. biom. Ser. 3, 1. 


SCHMIDT, W. 1925. 
Der Massenaustausch in freier Luft und verwandte Erscheinungen. 
Probl. kosm. Phys. 7, 1. 


SUTTON, 0. G. 1932. 
A theory of eddy diffusion in the atmosphere. 
Proc. roy. Soc. A, 135, 143. 


WADLEY, F. M., and WOLFENBARGER, D. 0. 1944. 
Regression of insect density on distance from centre of dispersion as shown by a 


Ff. agric. Res. 69, 299. 


WRIGHT, S. 1939. 
The distribution of self-sterility alleles in populations. 
Genetics 24, 538. 


A. J. BATEMAN 


cultivated plants. 


theory of random migration. 


study of the Smaller European Bark Beetle. 





al 





CONTROLLED X-SEGREGATION AT MEIOSIS 
IN TEGENARIA 


S. H, REVELL 
John Innes Horticultural Institution, Merton 


Received 10.vi.47 
1. INTRODUCTION 


OBSERVATIONS made upon a system operating under abnormal 
conditions are often of value in finding out how its normal function 
is controlled. This method of attack has been most profitable in 
cytology. In the present paper an abnormal type of meiosis, apparently 
characteristic of male spiders, is described, and its unusual features 
discussed with particular regard to the general problems of chromosome 
polarisation and chromosome movement on the spindle. 

The Aranee have received little attention from cytologists. Wallace 
(1909) first reported the multiple sex chromosome mechanism of an 
XXO type in Agelena nevia males, and this was confirmed by Painter 
(1914) in twelve widely representative genera. Recently Hard (1939) 
described the same mechanism in Schizocosa crassipes. Both X’s are 
said to be markedly heterochromatic and they do not form chiasmata, 
though they remain side by side throughout the first meiotic division. 
At first anaphase they move together to the same pole. It is the 
adaptation of meiosis to this unusual sex chromosome mechanism 
that makes the material of special interest. 


2. MATERIALS AND METHODS 


Three common British household spiders were examined— Tegenaria 
atrica, T. derhamii and T. domestica. All material was taken from 
the wild. 

Two techniques were adopted. Squashes in Lacmoid stain 
(Darlington and La Cour, 1942), made after fixation of whole testes in 
acetic alcohol, were found to be excellent for the meiotic and spermato- 
gonial chromosomes in all three species. Some mitoses were also 
obtained from ovary tissue in 7. domestica by this technique. To 
follow the orientation of the prophase and metaphase chromosomes 
to the centrosomes at meiosis some material was fixed in 2BD, sectioned 
at 20 p and stained in Crystal Violet. 

337 











S. H. REVELL 





3. MITOTIC CHROMOSOMES . 


In all three species of Tegenaria examined the autosome number 
is forty, but while 7. airica males have the characteristic XXO 
complement that has previously been reported in the Araneae, the 
species 7. derhamit and T. domestica have three-X males. Several 
somatic counts of forty autosomes and the expected number of six 
X’s were obtained from T. domestica females. 

In spermatogonial metaphases the pairs of autosomes and the two 
or three X’s show strong somatic pairing. The autosomes in each 
species are indistinguishable ; they vary little in length and all have 
centromeres apparently terminal. The X’s can be recognised by their 
greater length ; and at meiosis, when they are together, differences in 
their length can also be detected (pl. I, figs. 4, 5, 6 and 7). 


4. MEIOSIS 
(i) General 


The course of meiosis in the three species is essentially the same 
and they are therefore described together. The investigation was, 
however, based primarily upon observations on the two-X species, 
T. atrica, because it was found to be the most favourable material. 

During the pre-meiotic resting stage the autosome number of 
forty pro-chromosomes can be seen, diffusely outlined and evenly 
distributed round the nuclear membrane. The X’s are already 
paired and are strongly heterochromatic, being super-condensed and 
deeply stained. Their outline is clearly defined, in marked contrast 
to the prochromosomes which, if they represent heterochromatic 
autosome segments, are evidently not so heavily charged with nucleic 
acid as the X’s. 

At leptotene each autosome thread has a heterochromatic chromo- 
mere at one end, which lies against the nuclear membrane. These 
chromomeres are evenly spaced at first ; but subsequently they move 
round to concentrate on one side of the nucleus, presumably determined 
by the centrosomes, which are, however, not resolvable at this stage. 
The leptotene threads are otherwise weakly stained and have been 
unorientated hitherto. The effect of this migration of their terminal 
chromomeres is to polarise each thread so that it lies across the 
nucleus and parallel to its fellows. 

When polarisation is completed autosome pairing begins, which 
is rapid and always complete. By the time of pachytene the terminal 
heterochromatin has lost its differential charge. At diplotene one 
chiasma is formed in each bivalent : no variation in chiasma number 
and no failure of chiasma formation was seen. There is a wide range 
in the point of origin of the chiasmata in each early diplotene nucleus, 
but the morphological similarity of the bivalents makes it impossible 
to be certain that this represents a random formation (pl. I, figs. 2-6). 
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There appears to be no terminal movement of the chiasmata during 
prophase. i 

The sex chromosomes do not form chiasmata. They remain side 
by side during the first meiotic division, and though they are already 
split at pachytene (pl. I, fig. 1) they are all at the same pole at first 
anaphase and do not divide. During interphase their pairing lapses 
(pl. I, fig. 8), though they remain condensed, and at the second division 
they come on to the plate separately and divide mitotically. The 
X’s are visible in the resting nuclei of half the newly formed spermatids. 
During the first division in the three-X species each X is always 
straight and in such a position that it is paired to both the others. 
In T. atrica, however, the X’s are sometimes relationally coiled at 
pachytene. 

In. all three species the X’s show properties characteristic of 
heterochromatin. They are often sticky, and the relational coiling 
that they show in T. atrica may be due to their sticking together. By 
the time of spindle formation they are undercharged, and it is in this 
condition that differences in their length can be most easily seen. 
Reports on other genera have stated that the X’s are of exactly equal 
size, but this is not the case in Tegenaria. In lacmoid squashes of 
T. atrica cells at metaphase they are 7 and 8 yp long, and in T. derhamii 
and J. domestica between. 5 and 6y. At interphase and in the 
spermatids they are super-condensed. 


(ii) Polarisation in the nucleus 


Polarisation plays an important part in meiosis in Tegenaria. I 
shall now relate the data already given to the changes that occur in 
chromosome orientation, summarised in the text fig. 

The centrosomes cannot be seen at the time of the concentration 
of chromosome ends prior to zygotene. At pachytene they are clearly 
visible, lying in a line at right angles to the nucleus in what we may 
call proximal and distal positions (pl. II, fig. 9). Each chromosome 
association is polarised with one of its ends towards them. The 
attraction is preferential, and always gives the same polarisation 
pattern, in which the X’s are in a central position nearest the centro- 
somes (pl. I, fig. 1). The only variation observed was the occasional 
failure of one or two of the autosome bivalents to polarise. 

Polarisation does not lapse at diplotene as is usual in animal 
meiosis but, instead, continues throughout the rest of prophase. As 
the bivalents begin to condense a space is left on the side of the nucleus 
opposite to the centrosomes, and here the occasional unpolarised 
bivalents can be seen, pushed as it were against the nuclear membrane. 

At about the end of diplotene the centrosomes begin to move apart ; 
each continues to act as a polarisation centre and the chromosomes 
therefore separate into two groups. These groups are unequal ; for, 
while the proximal centrosome always stays near the nucleus, the 
Y 
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distal one makes a wide sweep at first, skirting the cell membrane 
before it returns to a closer course again (pl. II, fig. 10). It always 
happens, therefore, that the proximal centrosome retains the X 
complement and from eleven to fourteen of the bivalents, while the 
distal centrosome has only six to nine bivalents and never the X’s 
(pl. I, figs. 3 and 4). 

By early diakinesis, before centrosome movement is much 
advanced, it can be seen that each paired X is polarised by its centro- 
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Fic.—Chromosome polarisation at the first meiotic division in Tegenaria. 


mere end. This is also true of most of the bivalents of any comple- 
ment, and no case was seen in which it was the distal chromosome 
ends which were demonstrably polarised. It is likely, therefore, that 
it is always the heterochromatin near the centromere that is attracted 
to the centrosomes. 

The two groups follow their respective centrosomes round, and by 
late diakinesis they are on nearly opposite sides of the nucleus, held 
closely against the nuclear membrane (pl. II, figs. 11 and 12). Spindle 
formation follows immediately upon this stage ; there is no typical 
diakinesis, though occasional nuclei were seen in T. domestica in which 
polarisation was relaxed sufficiently to give an almost normal dispersal 
of bivalents. 


(iii) Polarisation on the spindle 


After spindle formation chromosome polarisation is still completely 
maintained for a time, delaying metaphase co-orientation of the 
centromeres. In contrast to diakinesis the chromosomes of each 
group are now ranged in a semicircle round their pole so that all 
bivalents are the same distance from the centrosomes. This distance 
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is much greater than that which separated the autosomes nearest 
the centrosomes from them during prophase (pl. II, cf. figs. 11 
and 14). Surface repulsion between the chromosomes is reduced, 
and in this respect they are in a condition comparable to 
prometaphase. In the sectioned material cells at this stage 
showed so striking an imitation of anaphase that they were at 
first wrongly seriated (cf. Wallace, 1909, fig. 26). In lacmoid 
squashes, however, there is no danger of confusion (cf. pl. I, 
fig. 4, with pl. II, figs. 13 and 14). I have called it the Polarised 
Metaphase. 

After this delay the bivalents react to the spindle. They do not, 
however, react together but in turn, and each bivalent reaches a 
metaphase condition at a different time. Thus the metaphase plate 
is assembled gradually, by a successive depletion of the polarised 
groups, each bivalent remaining at the same constant distance from 
its centrosome until its turn comes, when its centromeres first co- 
orientate axially and then undergo congression by movement into 
the equatorial plane. 

The progress of congression is clear and significant. The order 
in which the bivalents move down to metaphase from the poles appears 
to depend largely on the position of their single chiasma, and conse- 
quently on the distance separating their centromeres. The bivalents 
that co-orientate first are in general those with the most distal chias- 
mata, and these bivalents congress on or near the axis of the spindle 
(plate II, figs. 15 and 16). The last bivalents to react are those 
with an extremely proximal chiasma, in which the centromeres are 
very close together ; these bivalents move down the spindle in an 
outside position and congress on the edge of the plate (plate II, 
figs. 17 and 18). I have called any cell with polarised and metaphase 
bivalents co-existent on its spindle a Transitional Metaphase (cf. 
Wallace’s fig. 23). Transitional metaphases representative of all 
stages in the change are seen, though the process seems to slow down 
as full metaphase is approached. 

These pre-metaphase stages are not instantaneous conditions. 
They are stable and lasting. Polarised metaphase, transitional 
metaphase and full metaphase are seen in the proportions of about 
3:6:1, and it is quite clear that the spindle is fully formed from 
the first. The polarised metaphase must be a stable phase during 
which the chromosomes are at rest on the spindle, and the large 
number of transitional metaphases observed indicates that the change 
to full metaphase occurs only slowly. 

The X’s never congress equatorially on the first division spindle, 
and remain apparently polarised until late anaphase (pl. II, figs. 16-20). 
They are thus controlled throughout the division by the centrosome 
that was proximal at pachytene, and at telophase they are included 
in the same nucleus. During both polarised and transitional meta- 
phase they are further from the centrosomes than are the autosomes 
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(pl. I, figs. 4 and 5). They are occasionally separated, though always 
on the same side of the spindle. 

At telophase a non-disjoined bivalent is sometimes seen near one 
pole. This is probably due to a delay in congression. No irregularities 
were ever observed in the behaviour of the X’s. 

The second divisions show no polarisation. 





5. SEX DETERMINATION 


All reports agree that the multiple X chromosomes in spiders are 
wholly heterochromatic and of exactly the same length, and that, 
though they remain side by side during the first meiotic division, 
they never form chiasmata. White, in his review of multiple sex 
chromosomes (1941), suggests on this evidence that the X’s are 
completely homologous and that true pairing and crossing-over is 
only prevented by their being heterochromatic. In the Tegenaria 
species examined the two or three X’s are of different lengths. Since 
the autosome number is constant there has evidently been an increase 
in number within the X complex, probably from an XO system, 
but that they have differentiated is clearly shown by these differences 
in length. Such differences could not be maintained with regular 
pairing in the homozygous sex (to which there is no known exception) 
if the different X’s did not form separate bivalents. Nor could they 
form regular bivalents in the female unless they were qualitatively 
as well as quantitatively differentiated. The association of X’s in the 
male, on the other hand, can be accounted for by their strong 
polarisation to the same centrosome, combined perhaps with hetero- 
chromatic stickiness ; though homology may play a part, it is not 
necessary to account for their pairing. 

No meiotic divisions were obtained in females, but the six X’s in 
mitoses of a 7. domestica female show that sex is determined by an 
X,X_,X, : X,X,X,X,X,X, mechanism in T. domestica and T. derhamii, 
and in T. atrica probably by an X,X, : X,X,X,.X, mechanism. 


6. CONTINUOUS POLARISATION 
(i) The control of X segregation 


Chromosome attraction to the centrosomes is general in animal 
meiosis. Descriptions have been given by Gelei (1921), and especially 
by Janssens (1924) and Darlington (1936) in spermatogenesis in the 
Acridine, where it can be observed under very favourable conditions. 
Its typical features may be summarised as follows :— 

(a) The autosomes are polarised by one or both ends, and the 
ends affected are heterochromatic. 

(6) The X is not so strongly polarised as the autosomes and may 
be unpolarised. 
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(c) The primary polarisation pattern may be modified by 
polarisation of a different nature—stickiness of heterochromatin, 
particularly of the X. 

(d) Chromosome-centrosome polarisation lapses at the end of 
pachytene. Heterochromatic stickiness does not. 

The concentration of chromosome ends occurs at leptotene, and 
it is reasonable to suppose, as Gelei (1921) has suggested, that 
polarisation is usually an adaptation which favours regularity of 
pairing. Beyond that it does no more. 

Tegenaria differs in four ways: (i) the X’s are more strongly 
polarised than the autosomes, (ii) polarisation is prolonged until 
anaphase, (iii) the X’s are always strongly attracted to one centrosome 
while the other moves in such a way as to leave them together, (iv) 
heterochromatic stickiness of the autosome is much reduced and 
never modifies the primary pattern. Thus in Tegenaria it seems that 
polarisation is secondarily utilised to keep the X’s together and to 
ensure their correct segregation at anaphase, and that the special 
features are adaptations to this secondary function. 


(ii) Secondary effects of the mechanism 


The action of the centrosomes on the chromosomes is apparently 
indiscriminate, for the autosomes as well as the X’s remain polarised 
both in the nucleus and on the spindle. The consequences of this 
secondary effect and its implications will now be considered. 


In the nucleus 


Morse (1909) has described a type of polarisation in cockroaches 
that is prolonged until metaphase, and Suomalainen (1946) has 
shown that in these species the unusual appearance of the prophase 
bivalents is due to the centromere and chromosome repulsion not 
being sufficient to open out the loops between chiasmata. White 
(1938) and Hughes-Schrader (1943) have described what is probably 
the same condition in several mantid species, and here also polarisation 
lasts until metaphase. 

In a normal meiotic prophase centromere and chromosome 
repulsions are suddenly set up within bivalents at the onset of diplotene, 
when polarisation in most animals ceases. Now repulsions within 
bivalents must be correlated with those between bivalents, and it 
seems likely that it is the sudden increase in intra-bivalent repulsions 
which breaks up polarisation at this stage. If these repulsions are 
reduced, as in cochroaches and mantids, then polarisation continues. 
And in Tegenaria we must assume that, in order that the X-chromosomes 
may be kept together, the repulsions are so adjusted as not to be 
sufficient to give the dispersal characteristic of typical diplotene and 
diakinesis stages, and polarisation is therefore caused to continue. 
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What attraction is concerned in the polarisation of chromosome 
ends? In Tegenaria the chromosomes follow the ‘centrosomes when 
they move round the nuclear membrane. This has also been observed 
in other cases where polarisation lasts late into prophase—in cock- 
roaches (Morse, 1909), in mantids (White, 1938, and Hughes- 
Schrader, 1943), and in Anisolabis (Schrader, 1941). Evidently, then, 
the attraction is to the centrosomes. Moreover, Wickbom (1945) 
finds in Triturus that, following colchicine injections, there is a failure 
of chromosome polarisation during prophase in the primary spermato- 
cytes, and he interprets this failure as due to an effect of the treatment 
on the centrosome. What is it then in the chromosome end that is 
attracted to the centrosome? In Tegenaria as in other animals the 
polarised ends are heterochromatic at certain stages. Ribbands 
(1941) has suggested very plausibly that the proximity to the nuclear 
membrane of chromosome regions heavily charged with nucleic acid 
is due to the rapid synthesis of nucleotides by these regions. During 
polarisation the centrosome, which must itself be a nucleoprotein, is 
undergoing rapid growth and is organising the spindle. Chromosome 
polarisation is therefore perhaps brought about by the chemical 
co-operation of the bodies which appear to attract one another. 


On the spindle 


The rigidity of the polarising mechanism also introduces some 
unusual features into the behaviour of the autosomes on the spindle. 
Normally, when the meiotic spindle forms, the centromeres of each 
chromosome association show two characteristic reactions to the new 
medium—they co-orientate, and they undergo congression in: the 
equatorial plane (Darlington, 1937). These reactions are more or 
less coincident with spindle formation. The behaviour of the bivalents 
in Tegenaria at this stage, as we have seen, is remarkable in two 
respects. First, their reaction to the spindle is delayed and they 
remain for a time attracted to the centrosomes. Secondly, when they 
do react, they do not do so simultaneously but instead successively, 
and the full metaphase plate is assembled by degrees from the two 
polarised groups. 

A situation in which chromosomes are, or appear to be, attracted 
to the poles of the spindle merits examination. During prophase the 
polarising attraction has been strong enough to draw the bivalent 
ends against the nuclear membrane, close to the centrosomes. With 
the invasion of the nucleus by the spindle at the end of diakinesis, we 
find that, though polarisation continues, the bivalents take up positions 
on the spindle in which they are further from the centrosomes than 
before (pl. II, cf. figs. 11 and 14) ; evidently when the bivalent ends 
are in the orientated spindle medium they cannot approach the 
centrosomes so nearly as when they are outside it. Clearly the poles 
are repelling the centromeres within the meiotic spindle in Tegenaria 
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from the first, but during the polarised stage this repulsion must be 
in equilibrium with the attractive force between chromosome ends 
and centrosomes. 


7. THE TACTOID HYPOTHESIS 


What then is the nature of this equilibrium? Bernal (1940) has 
advanced a hypothesis to account for spindle formation and for the 
consequent chromosome reactions, based upon the assumption that 
the spindle is a natural tactoid. Tactoids are the characteristic 
spindle-shaped gels that appear under suitable conditions in sols such 
as those of tobacco mosaic virus. The shape of the tactoid is due to 
the modification of the parallel orientation of the axially arranged 
molecules on either side of the equatorial plane so that they converge 
upon the poles. It is only with long needle-shaped molecules that 
the necessary surface tension conditions exist at the poles which 
bring this modification about. Enclosures of unorientated medium 
in the tactoid take on a tactoid shape and are called negative tactoids. 
They tend to move into the equatorial plane and into the axis of the 
positive tactoid because they cause least distortion of the molecular 
orientation in this position. 

Bernal suggests that the centrosome first induces an orientation 
of the long molecules radiating from itself to give an aster. It then 
divides as a result of the instability of the system, and a tactoid is 
formed, which increases in size by the growth of its molecules and 
envelopes the nucleus as the centrosomes move round. The centro- 
meres (at least when active) are assumed to be able to cause local 
dissolution of the long molecules, and when they come into the spindle 
medium they produce small negative tactoids which move into the 
equatorial plane to give metaphase congression and orientation. 
Anaphase movement is brought about by the subsequent growth of 
each negative tactoid, and the movement apart of the two daughter 
centromeres within it. 

The birefringence shown by the living spindle (Schmidt, 1937) and 
its general tendency to contract transversely in hypertonic solutions 
(Belar, 1929) support the view that it is a tactoid. There is little 
direct evidence for the existence of the postulated negative tactoids. 
Now, however, indirect evidence is available. There are the observed 
effects of experimental variables such as cold and colchicine (Barber 
and Callan, 1943), and observations on naturally aberrant mechanisms 
such as those of the autosomes and sex chromosomes on the spindle in 
Tegenaria. 

On Bernal’s theory each bivalent should begin to react immediately 
the nucleus was penetrated by the orientated medium, owing to the 
polar repulsion of the negative tactoid formed by the centromere pair. 
But in Tegenaria the attraction between chromosome ends and centro- 
somes still seems to operate, since the chromosomes take up positions 
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on the spindle which are the points of equilibrium between these two 
forces. On the assumption that the repulsion of negative tactoids by 
the poles increases steeply towards the poles, the chromosomes might 
well be stabilised in the two semicircular groups around these poles. 

At transitional metaphase this equilibrium is disturbed by the 
individual reaction of each bivalent to the spindle. The order in 
which the bivalents move on to the plate is significant. Bivalents 
with terminal chiasmata tend to congress first. This means that 
movement is influenced by the distances between the pairs of centro- 
meres—that is, presumably, by the size of the negative tactoid formed 
by each bivalent. 

The relative uniformity of autosome size in Tegenaria is probably 
an adaptation to reduce these differences as much as possible. 

The behaviour of the X group may be compared with that of the 
type of univalent which never reaches the plate at all, and is included 
undivided in one of the telophase nuclei in the same way as the X- 
chromosome in the Acridine. It does not move on to the plate because, 
having a single undivided centromere, it has no negative tactoid. 

This interpretation is not offered as the only physical explanation 
of the facts. But it is clear that, from the conception we have of the 
cell spindle as a liquid crystal of tactoid type, it is possible to explain 
aberrant as well as normal types of cell division. 


SUMMARY 


1. Three species of Tegenaria have 20 pairs of autosomes. The sex 
chromosome number varies according to the species—2 or 3 X’s 
in the males and twice the number in the females. 

2. At meiosis in the males the X’s are strongly heterochromatic ; 
they do not form chiasmata but remain side by side, and at first 
telophase they are included in the same nucleus. The regular 
segregation of the X’s is ensured by their being held near the same 
centrosome throughout the first division. 

3. The autosomes are attracted to one centrosome or the other 
during prophase. Consequently there is no typical diplotene or 
diakinesis, and a stage resembling anaphase precedes metaphase. 
The autosomes later congress on the spindle, while the X’s do not. 
This continuous polarisation is considered to be due to a reduction 
or delay in the centromere and chromosome repulsions which normally 
disrupt polarisation at diplotene. 

4. The behaviour of the autosomes and X’s on the first meiotic 
spindle is shown to be describable in terms of the tactoid theory of 
spindle structure and the known cycles of centromere activity. 


I wish to thank Dr P. 'T. Thomas for advice during the preparation of this paper. 














Prate I,—Tegenaria atrica (except fig. 6, T. domestica) ; first division in the Sperm mother 
Acetic-alcohol, Lacmoid. x 1400. 









Fie. 2.—Diplotene, 





Fic. 1,—Pachytene. 


Fic, 3.—Diakinesis : 2 X’s in top Fic. 4.—Polarised metaphase : 2 X’s 





group. in top group. 
Fic. 5.—Transitional metaphase, Fic. 6.—Full metaphase with 3 X’s, 
Fic. 7.—Anaphase. Fic. 8,—Interphase: 2 X’s in top 


nucleus, 



















Prate II.—The same : 2BD, Crystal Violet. x 2600. 









Fic. 9.—Pachytene: proximal Fic. 10.—Diplotene: movement Fic. 11.— Diakinesis; 
and distal centrosomes. of the distal centrosome. . two polarised groups 
of chromosomes, 


Fic. 12.— Diakinesis: two Fics. 13 and 14.—Polarised metaphase. 
polarised groups of chromo- 
somes, 








Figs. 15-17.—Transitional metaphase : stages in the ormation of the metaphase, 





Fic. 18.—Transitional meta- Fic. 19.—Full metaphase, Fic. 20,—Anaphase. 
phase : the last stage, 























CONTROLLED X-SEGREGATION IN TEGENARIA 347 


REFERENCES 
BARBER, H. N., and CALLAN, H. G. 1943. MORSE, M. 1909. 
Proc. roy. Soc. 131, 258. Arch. Zellforsch. 3, 483. 
BELAR, K. 1929. PAINTER, T. 8S. 1914. 
Arch, Entw. Org. 118, 359. Zool. Fahrb. 38, 509. 
BERNAL, J.D. 1940. 7 i 
The Cell and Protoplasm. a. 


. Genet. 41, 411. 
Am. Ass. Adv. Sci. Symposium. oo ee 


6 SCHMIDT, W. J. 1937. 

ny Tap - - Die Doppelbrechung von Karyoplasma, & yto- 
J. Genet. 33; 485. plasma und Metaplasma. Berlin. 
DARLINGTON, C.D. 1937. ieee. sae 

Recent Advances in Cytology. London. 3. Morph. 68, 123. 

DARLINGTON, C. D., and LA COUR, L. F. 


1942. SUOMALAINEN, E. 1946. 

The Handling of Chromosomes. London. Ann. Acad. Sc. Fenn. A. IV. 10, 1. 
GELEI, J. 1921. WALLACE, L. B. 1909. 

Arch. Zellforsch. 16, 88. Biol. Bull. 17, 120. 

HARD, W. L. 1939. WHITE, M. J. D. 1938. 

JF. Morph. 65, 121. Proc. roy. Soc. 125, 516. 
HUGHES-SCHRADER, S. 1944. WHITE, M. J. D. 1940. 

Biol. Bull. 85, 265. F. Genet. 42, 143. 

JANSSENS, F. A. 1924. WICKBOM, T. 1945. 

La Cellule 34, 135. Hereditas 31, 241. 
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IN 1945, in Line Thirteen of the series of inbred lines set up by 
Professor R. A. Fisher in the house mouse, Mus musculus, a disturbed 
segregation of shaker (sh,) and sex was noticed. Data collected from 
three unrelated lines, Eight, Twelve and Eighteen, in which wavy 
(wv,) and shaker (sh,) were segregating, indicated linkage between 
these two factors and sex. G. D. Snell and L. W. Law (1939) had 
published data from which a crossover value of 25 per cent. between 
wavy and shaker had been obtained. An experiment was therefore 
set up, designed to investigate their linkage relations not only with 
each other but also with sex. 

Appropriate genotypes were taken from Lines Twelve and Eighteen 
and a separate stock of wavy and shaker; crosses were made to 
produce each of the four kinds of males needed. 





(i) Tricoupling 3 wa Sia 
(ii) wv, and sh, in coupling 3 rae 
(iii) sh, and sex in coupling e) ay 
(iv) wv, and sex in coupling 3 ene 


In order that any causes other than linkage, likely to disturb the 
ratios obtained, should be distributed at random throughout the 
matings, mates were chosen during the following generations without 
regard to their consanguinity. Approximately a hundred mice were 
bred from each type of triple backcross (comprising twenty-four 
matings). The results are tabulated by phenotypes on page 350 
(table 1). 

In a form more convenient for considering linkage relations, these 
data may be tabulated as under, the columns representing respectively : 


I. Parental types. 
II. Recombinants, with crossovers interchanging X and Y. 
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III. Recombinants, with crossovers interchanging wv, and its 
normal allelomorph. 

IV. Recombinants, with crossovers interchanging sh, and its 
normal allelomorph. 


TABLE 2 
Observed (m-+x) 





Recombinants 





Paternal types | I II | III|IV| Totals 


— | X | weg! shy 

















(i) Tricoupling , ‘ : ‘ oath 23 | 44] 18] a1 106 
(ii) wvg—shg in coupling . Gites 36 | 55/17] 12 120 
(iii) shg—sex in coupling . sine 39 | 46] 19 | 23 127 
(iv) wvg—sex in coupling . 3 a 29} 40] 15 | 16 100 

Totals . - | 127 | 185 | 69 | 72 453 





























It will be seen that this table involves the grouping of data 
concerning pairs of complementary genotypes from each kind of 
mating, these pairs being arranged as are the letters in a symmetrical 
4x4 Latin Square. If there are any viability differences they may 
normally be expected to show most plainly between the members of 
these pairs. A x? test on their approximation to 1:1 ratios will 
indicate if any of these pairs of genotypes are unequally viable 
(table 3). . 











TABLE 3 
Pairs of complementary Totals 2 
genotypes x 
wish, 9 61) ; 
+ +4 52] 113 o-7168 
++ 2 64 ; 
wissh, 3 4 - 08475 
Sh, Q 51 5 
wy +3 38} og | oad 
wig + 2 51 F 
had Ba} 113 ics 
453 3°0846 ; n=4 

















The low x? indicates that there are no obvious viability disturbances. 
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A further and more comprehensive test, to discern whether 
viability or any other factors are disturbing the data, may be made 
by comparing the observed frequencies with those expected on the 
basis of the marginal totals in table 2 (table 4) : 












































TABLE 4 
(a) Expected (m) 
I II III IV 
(i) 29°71 74 43°2892 16°1457 16:8477 106-0000 
(1i) 33°6424 49°0066 18-2781 19'0729 120°0000 
(iii) 35°6049 51°8653 19°3 20°1854 1270000 |, 
(iv) 28-0353 40°8389 15°23! 158940 100:0000 
127°0000 1850000 69-0000 72*0000 453°0000 
(b) (x?/m) 
I II III IV Total x? 
(i) 1+5184 O’O117 0°2130 10234 
(1i) 0°1652 0°7330 0:0894. 2°6229 
(iii) 0°3237 06633 0-0061 0°3925 
(iv) 0°0332 O°0172 0°0035 0:0071 
2°0405 1*4252 0°3120 4°0459 7°8236 (n=9) 


























The x? is again insignificant for these nine degrees of freedom. 

Finally, any inequality due to viability or other factors between 
the four pairs of genotypes in table 3, which might have disturbed the 
linkage relations, will become evident by comparison with the 
expected frequencies of these genotypes ; these can be drawn from 
table 4a and appear below, together with a x? test comparing them 
(table 5) : 











TABLE 5 

Genotypes Expected (m) Observed (m+ x) x*/m 
wish, 292 and + + gd 113°9624 113 0:00813 
+ + QQand wogsh, oo a 118 0°28426 
+ she 29 and wo,+dd 1116624 109 0:06348 
we + 9Qand + siadd 1150264 113 0'03570 
453°0000 453 039157 

(n=3 




















The x? is extremely low, and contributes very little to the total 
x? of 7-8236 for nine degrees of freedom (in table 45). 

It appears, therefore, that viability and other disturbances are 
negligible. 
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It remains to test the hypothesis of independent segregation of 
sex with shaker and wavy. A x? test on the approximation to a 
1:1 ratio of the parental types of offspring against the recombinants 
involving an interchange of X and Y is given below : 





Parental types | Recombinants x? 





127 185 10°782 (n=1) 








The value of x? for one degree of freedom is equivalent to a normal 
deviate of 3°:284 standard deviation; the one in 1000 level of 
significance requires 3:241. The hypothesis of independence is 
therefore untenable. 

It is reasonable, therefore, to conclude that the factors wavy and 
shaker are linked, not only with each other, but also with sex. The 
recombination values derived from table 2 are (table 6) : 




















TABLE 6 
: ; : Percentage Standard 
Linked loci Recombinants | Total antenehiaiiies poser 
wv, and sh, 69+72=141 453 31°06 per cent. | 2°174 per cent. 
wv, and sex 185-+69=254 453 56-07 ‘ia 2°332 = 
sh, and sex 185+72=257 | 453 | 56°73 5 2328 ly, 











The recombination value between shaker and wavy is higher 
than that found by Snell and Law (1939). However, their estimate 
was based principally on female gametogenesis, and that given above 
is based on male gametogenesis ; it is not impossible, therefore, that 
they should differ. For males, Snell and Law report 15 recombinants 
out of 56, an observation not incompatible with a recombination value 
exceeding 30 per cent. 

The two main conclusions to be drawn from table 6 are: first, 
that the recombination values between shaker and sex and between 
wavy and sex are both significantly greater than 50 per cent. ; and, 
secondly, that the two loci, wavy and shaker, recombination between 
which is as great as 31 per cent., are nevertheless similarly related as 
regards their values with sex. 

Explanation of these phenomena is outside the scope of this paper, 
but it is submitted that while these conclusions appear mutually 
repugnant, the data given above supply very strong evidence for each 
of them. 
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I am indebted to Professor R. A. Fisher for his guidance and encouragement 
and for the use of his stocks ; and also to Mr T. C. Carter and Miss M. F. I. Speyer 
for their help in completing the experiment. 
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1. THE FORMULATION OF LINKAGE RELATIONSHIPS 


Since Morgan’s’ original proposal in 1911 of a linear arrangement 
for genes in the same chromosome, it has been obvious that it should 
be possible to specify, in terms of the lengths, in some appropriate 
metric, of the intercepts between them, the entire series of gametic 
frequencies for an organism heterozygous at any number of linked 
loci. The lack of such a formal specification constitutes the most 
serious gap in current expositions of genetical theory. The following 
attempt at such a formulation, stimulated by the experimental data 
recently obtained in this department on the sex chromosome of the 
house mouse (Wright,* 1947), owes much to earlier writers, among 
whom especially should be mentioned K. Mather? (1937) and 
Kosambi? (1944), both of whom have supplied ideas essential to 
what follows. The comparisons of the theoretical expectations with 
the relevant observations in other cases is a task beyond the scope of 
the present paper. 

Any segment of a chromosome bounded by two marked loci, 
which enters a gamete, will have experienced in the preceding meiosis 
0, 1, 2,. . . breaks involving interchange with one or other strand 
from the homologous chromosome. These occurrences are mutually 
exclusive and exhaustive, so that if fp, ~,, po, . .. represent the 
frequencies of these events, it follows that 


fPothithet ..- =1. 


With these quantities we may associate others, s,, representing 
the probabilities of r or more interchanges, so that 


We may recognise in s, the probability integral of the position of 
the r“ break from the first locus, the integral being evaluated at the 
position of the second locus. The values s, like p, are always positive, 
and each value is less than the last by the deduction of the corresponding 
p, ie., 

Sp—Sp41 = py 
355 Z 
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The observable recombination fraction between the two loci, 
denoted by », is seen to be 


J=fit Pst Pst.---, 


= $y—Sq+53—Sg +55 — ee 8s 


while the map distance, representing the average number of inter- 
changes in the segment chosen is 


X= pPyt2pot3pat4pat - +--+; 
= 5+ Set Sgt Sat or ee aa 


The latter quantity is additive, since if B lies between A and C, 
the average number of interchanges between A and C must be the 
sum of the average numbers between A and B and between B and C. 
Consequently x, though not directly observable, supplies a consistent 
metric for mapping. 

Kosambi has shown that, to a very satisfactory approximation in 
many cases, the two genetically important quantities x and y are 
connected by the relation, 


tanh (2x) = 2». 


This relation suffices to specify the gametic series of heterozygotes 
at three loci, but not at four or more, for which we should need 
expressions for the entire series 5,, 59, 53, ... in terms of some 
common parameter, in terms of which, therefore, both x and » could 
be specified. 

Owing to the probability of disturbances in the neighbourhood 
of the centromere, and of the ends of the chromosome, we shall not 
assume the exactitude of Kosambi’s relationship, but instead shall 
introduce a metric u, in terms of which interference of neighbouring 
interchanges shall be uniform, and in particular for which the 
probability of an interchange is reduced in the ratio 


tanh (47u) (1) 
by the influence of an interchange already established at a distance u. 
Since 
[tanh gag 
is log cosh ¢, 
and ele cosh ¢ 
is sech ¢, 


it follows that the probability that the next interchange to the one 
already established lies in the intercept du is 


sech ($u) tanh (4$u) d(4au) 
= d(—sech }nu). (2) 


This expression therefore defines the frequency distribution of the 
length of intercept between two adjacent interchanges. The theory 
we shall develop postulates that this distribution is independent of all 
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more remote breaks, so that the lengths of adjacent intercepts are 


distributed independently. 
The average value of the intercept length, u, found by evaluating 


[ua — sech 4m) 


is unity, since by partial integration this expression is equal to 


j sech }mu du, 
eee 
or to fee aw) 2? 
this becomes {5 4{tan- z| = I, 
I +22 7 


on making the readout 
Z = eis, 


It is for this reason that the argument 4zu has been used, rather than 
any other multiple of u ; for in regions far from either end of a chromo- 





RELATIVE FREQUENCY 
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Fic. 1.—Frequency distribution of intercept length sech ($7u) tanh ($2ru) d($2u) adopted as 
basis for calculation. 


some arm, this convention leads us to expect the average density of 
breaks to be unity, so that u will closely simulate the map distance in 
such regions. The distribution of intercept length is shown in fig. 1. 
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2. THE PAIRING SEGMENT OF THE SEX CHROMOSOME 


Let us now consider the special case afforded by the pairing 
segment of the sex chromosome in an organism such that (as Koller ® 
and Darlington (1934) have shown in Rattus norwegicus) chiasmata may 
occur between the centromere and the sex-determining portion. In 
the male no chiasmata can occur in the differential segment. Conse- 
quently, the repressive influence of neighbouring breaks will be 
reduced at the adjacent end of the pairing segment, and qualitatively 
we might expect the density of breaks to be raised in this region. 

Arguing more exactly, we may say that the distribution of the 
first break in the pairing segment will be that of a value u which is 
constrained to exceed some arbitrarily large value U. In other words, 
the distribution will be that of the limiting form of the tail of the 


original distribution, or 
df = e-*™ d(}mu), 


where u, is now measured from the end of the pairing segment. 
The probability that there will be at least one break within a 
distance u from the end of the pairing section is now given by 


1p = 5, = [e-™ d( dry) 


so that fo =e ™ 
5, = 1—e7*™, 
The probability of exactly one break may be evaluated by 
integrating, for values of u, less than u, the expression 
e~#™ d(}mu,) sech $n(u—u)), 
which is the product of the probability of a first break in the interval 
du, by the probability that there shall be no further break within the 


length u from the terminus. 
This probability is, therefore, 


pi = fort sech bo(u—uy)d(dmu). 


By making the successive substitutions 
v = 4n(u—u,), Zz =e’, 
its value is found to be 
dau 
2z dz 
ee 
ll 


= ¢~IIJog | 


hare e 
e~ im | e’sech v dv = mf 
° i I 
= Iog (1424 
so that py = e~*™f{log cosh $2u+4}nu} 
and Sg = Sy—py = 1—e *™{log cosh }2u+4nu+ 1}. 
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For breaks subsequent to the second the analytic forms are 
difficult, and recourse may be had to mechanical integration. 
Characteristic values of s for the distribution of the first few breaks 
are given in table 1. Fig. 2 shows the corresponding frequency 
distributions, 


TABLE 1 
Probabilities of at least 1, 2, 3, . . . breaks between given points and the sex-determining 


segment, with corresponding values of the map distance (x), and the recombination 
Sraction (y). 






































Distance in 
metric chosen Sy Sq Ss Sq Ss Sg | * per cent.|y per cent. 
u 
000 ° ° ty) fr) ve) oO te) te) 
} or10 "14536 62 14°598 | 14°474 
0°20 +26960 467 3 27°430 | 26-496 
| 0°30 °37577 1472 20 39°069 | 36-125 
| 040 "46651 3228 76 I 49°955 | 43°497 
| 
|}  0*§0 "54406 5780 209 + 60-399 | 48°831 
0-60 61034 go86 469 12 70°601 52°405 
| 0°70 66698 *13044 gi2 32 I 80°687 | 54°535 
0-80 *71539 *17520 1594 72 2 99°727 | 55°734 
0-90 "75076 = 22364 = 2504147 5 100°756 | 55°734 
1:00 ‘79212, 27434 3859 2730S 110°789 | 55°275 
I'10 82234 32601 5502 470 24 I 120°832 54°688 
1+20 ‘84816 = 37754 «= 7503. 763) 46 2 | «130°886 | 53-846 
| ; 
16 
44 
J 
‘4 * 
*6 
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2 ., 
° 
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METRICAL DISTANCE (u) 


Fic. 2.—Distributions of the first four breaks, counting from the junction of the differential 
and pairing segments, in terms of the metrical distance (u) from this junction. 
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The sum of the frequency ordinates for these. successive breaks 
gives the total density of interchange in relation to the metric 
u adopted. It will be seen that this density is slightly enhanced in 
the terminal region, but that it soon becomes almost uniform, so 
that increases of u become very nearly equivalent to equal increases 
in map distance. 

We are now in a position to compare the map distance measured 
from the sex-determining portion of the chromosome, with the 
recombination fraction observed between a given marker gene and 
sex. The comparison is shown in fig. 3. It will be seen that the 
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Fic. 3.—Relation between observable recombination (y) with sex and map distance (x) 
from the differential segment. 


recombination fraction, rising initially in equality with the map 
distance, attains a maximum of about 55°75 per cent. at a map 
distance of 98 or 99 centimorgans, and thereafter falls gradually 
towards 50 per cent. For a considerable distance of more than 30 
centimorgans the recombination fraction exceeds 55 per cent., and 
can be detected by reasonably careful experimentation. At positions 
from about 58 to 68 centimorgans from the terminus the difference 
from 50 per cent. recombination will be almost imperceptible, and 
genes in this region will appear to be inherited almost independently 
from sex, as will be the case also at distances much greater than we 
have explored. 

For lengths much greater than our figures illustrate, it is to be 
presumed that the calculations will become unrealistic, owing to the 
unknown position of the centromere, which in these figures has been 
taken to be infinitely distant. They should, therefore, be regarded as 
provisional and subject to revision in the light of a more exact 
examination. Qualitatively, however, we should expect them to 
give a good representation of the genetically observable relations of 
sex-linked genes. 
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3. COMPARISON WITH THE LINKAGE RELATIONSHIPS OF THREE 
GENES IN THE HOUSE MOUSE 


During 1945 Wright, working in this department, observed in 
several lines carrying wv, or sh, a linkage-like disturbance of the 
sex ratio, and proceeded to set up the definitive test reported by 
her in this number.‘ In the following year Falconer observed in 
Line 9 of these stocks a rather close linkage between the dominant 
waving gene Rex (Re) and sh. Since Re and wv, have similar effects 
on the hair, they could not be tested in the same line, but a few special 
matings, which Falconer > has reported, were sufficient to show that 
Rex and wavy were not allelomorphic, but on the contrary that they 
are located on opposite sides of the shaker locus. 

Snell and Law (1939) who first reported ® the linkage of wv, and 
Sha, give the recombination fraction as 25 per cent. in females, there 
being 64 recombinants among 256 young. For males they report 
only 56 young, and do not state whether their doubly heterozygous 
parents received the mutant or the normal genes from their fathers. 
These data cannot therefore be used for examining sex linkage, but 
give an estimate of 26-8 per cent. for recombination between wz, 
and sh, in male gametogenesis. We are indebted to Dr Snell, also, 
for some early intercross data (table 2) in which the males are 
known to have received both mutant genes from the dam. 


TABLE 2 





| Per cent. of expectation 





++ Wg shy wgsh, Total 


WU, sh 
110 13 19 12 154 64:9 80°5 
3 117 22 16 16 171 88-9 74°9 
































ORR es <a 227 35 35 28 325 





So far as these go they give maximum likelihood estimates of 
56-37 per cent. for the recombination of wavy and sex, in confirmation 
of Wright’s value of 56-07 per cent., but of 47-65 per cent. for sex and 
shaker, which is significantly lower than Wright’s value, 56-73 per cent. 
Apart, however, from being intercross data in which accurate allowance 
for differential viability is not practicable, it should be noticed that 
both recessives are seriously below expectation in frequency. The 
corresponding estimates by the product method are 57-24 per cent. 
for wavy and sex, and 48-30 per cent. for shaker and sex. Even the 
product method, however, can scarcely in this case eliminate the 
viability disturbance, since the deficiency may not improbably be 
unequal in the two series. Indeed, the average recombination found 
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here, 52°77 per cent., might be interpreted to exceed 50 per cent. 
only by reason of a greater elimination of recessive females. On the 
whole it seems best to use Wright’s values, without attempting to 
adjust them by the use of these earlier observations, 

Somewhat clearer confirmation is shown by some more recent 
backcross data, which Dr Snell has since kindly added. These are 
shown in table 2a. 

TABLE 2a 


wigshyX ++/Y 

















x 
wugshyX wushyX 3 
++ WV, shy wvsh, Total 
27 8 13 12 60 
3 14 5 7 15 41 
Total ° ° ° e 41 13 20 27 101 















































++ WV shy wuashs Total 
13 I 5 10 25 
3 9 I 3 4 17 
Total . : ; : $ 22 2 8 10 42 

















There is again some abnormality in the single factor ratios, but so far 
as recombination is concerned these mice, 143 in all, give estimates 
very close to those of Wright’s experiment. For linkage of wavy 
with sex we have here 79/143, or 55°245 per cent., for shaker with 
sex 78/143, or 54°545 per cent., and for wavy with shaker 43/143, 
or 30°070 per cent., all in close agreement with the values found 
in our own department. 

The paradoxical nature of Wright’s data consists in two points : 
(i) the appearance of recombination fractions between sex and wavy 
and between sex and shaker both significantly exceeding 50 per cent. ; 
(ii) the fact that two loci distant from each other by about 35 units 
of map distance should be almost equally related, as regards 
recombination, with sex. With three-point backcross data we are 
accustomed to recognise the order of the genes by the doubly 
recombinant class being distinctly rarer than all others. In these 
data, however, the striking and most significant difference lies between 
the old combinations, and those in which both wavy and shaker are 
separated from sex. 

These remarkable features are perfectly in accordance with 
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expectations based on fig. 3. We must suppose that both the loci for 
wv, and for shg lie near the maximum, probably on either side of it. 
There is apparently plenty of room in this region for loci 35 or 40 
units apart having recombination fractions insignificantly different 
from those observed. The data do not by themselves suffice to 
determine which of the two is nearer to the sex-determining region 
of the chromosome. 

The order is, however, rendered very probable by the evidence 
concerning Rex. The locus of Rex is estimated, on the small body 
of data available to Falconer, to be about 20 units from shaker, on 
the side distant from wavy. A summary of progenies from Rex males 
observed in this department gives to date the following frequencies 
(table 3). 








TABLE 3 
+9 +3 ReQ Reg | Total 
Coupling ; ; ; 119 108 116 120 463 
Repulsion ‘ . ; 64 | 59 46 | 57 226 








The ratios of recombinants to old combinations are therefore 
224 : 239 in coupling, and 121 : 105 in repulsion. It is obvious that 
recombination does not differ significantly from 50 per cent., and this 
indicates that the locus of Rex must be about 64 units from the end 
of the map. A more careful estimate, eliminating the possibility of 
small differences between the relative viabilities of Rex animals in 
the two cases, is supplied by the equation 


Pan 
I—y 239 105 


from which it appears that y, estimated from these data, is about 
50°96 per cent. 

It is therefore easy to assign positions to these three loci consistent 
with the whole of the genetical information so far available about 
them. With more data a closer tie-up should be possible, and it 
may even be necessary to consider such observable effects as the 
centromere of the sex chromosome may possibly have on these 
relationships. Triple backcrosses of males with wv, and sh, females 
from independent stock, and with sh, and Re are already in hand. 
As a provisional map, however, we propose 


Map distance 
Locus from terminus 
Re 65 cm. 
shy 85 ” 
WU 120 5, 


It ‘is still possible, however, that these three loci may lie in the 
reverse order with Rex some way beyond the centromere. 
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DISCUSSION 


The genetic phenomena of the sex chromosome in mice display 
the two surprising features, (i) of recombination fractions significantly 
exceeding 50 per cent., and (ii) of two loci, those of wavy and 
shaker, separated by a considerable interval of about 35 map units, 
showing no apparent difference in their linkage with a third locus, 
namely, that of sex, to be identified with the end of the pairing 
segment. Even though these phenomena are in harmony with the 
theory of interference which we have put forward, and which contains 
no strikingly novel feature, it may well be asked why such phenomena 
have not been observed in other organisms. 

If the basis of our calculations is at least approximately valid, 
the conditions for the observation of recombination fractions exceeding 
50 per cent. appear to be (a) the use of a marker gene located near 
the end of the chromosome, and (b) that the chromosome arm on 
which it is situated should be of considerable map length. The first 
of these conditions is admirably met by using sex itself as a marker, 
and probably is rarely satisfied by the location of autosomal genes. 

The other organisms in which the pairing segment of the sex 
chromosome can be studied do not seem to provide the conditions for 
finding a similar genetic situation. In Lebistes reticulatus, where 
partial sex linkage, or inheritance in the pairing segment, was first 
demonstrated, the known genes, though numerous, are all very 
closely linked with the sex-determining portion. They are, moreover, 
functionally associated with sex, being epigamic in character and 
sex-limited in manifestation to the male. Only if numerous genes 
of other kinds were available in this species could it be determined 
whether any were loosely linked with sex and, if so, whether in any 
such case the recombination fraction exceeded 50 per cent. 

In Man the conditions for the detection of incomplete sex linkage 
are such that only relatively close linkage, up to about go per cent., 
can be recognised, so that, if any part of the sex chromosome were to 
show recombination with sex exceeding 50 per cent., but less than 
70 per cent., it would, with the kind of data hitherto available, or 
likely to be available in the future, certainly appear to be inherited 
independently of sex. 

In Drosophila both X-borne and autosomal genes are known 
located close to the ends of their respective chromosome arms, and 
these arms are not all so short genetically as to preclude absolutely 
recombination between the gene and the centromere slightly exceeding 
50 per cent., though there is not much room for this. The bulk of 
genetic tests carried out with this genus, with a view to accurate 
mapping, has, however, for obvious reasons, been principally con- 
cerned with determining the shorter map intervals with precision 
with a view to inferring the longer map intervals from these by 
addition. Whether a search of the published data for large-scale 
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tests with longer intervals would yield any situation comparable to 
that which appears to prevail in the sex chromosome of the House 
Mouse we do not know, although it will clearly be profitable to 
try out our formulation of linkage relationships on the best of this 
material, and on other species where, with fewer genes observable, 
longer intervals have often been determined with care. 

If our views are correct, the situation in the House Mouse, though 
qualitatively similar to that revealed by Koller and Darlington in the 
Rat, must differ quantitatively from that species in that the arm 
of the pairing segment bearing the sex-determining tract must, so 
far as we can judge, extend to some 80 units of map distance in the 
House Mouse ; whereas the frequency of equational separation of the 
centromeres observed by Koller and Darlington suggest that the 
distance in the Rat is not more than 5 to 10 map units. Further 
cytological studies of the Mouse will therefore be awaited with great 
interest. 
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l. INTRODUCTION 


Wricut (1947) has reported the occurrence of a new hereditary 
character in the house mouse (Mus musculus, L.) ; the affected mouse 
typically shows soft undulations of the tail and a tendency to a 
hunchback. She concluded that the inheritance of the character is 
unifactorial and recessive, and suggested the name “ undulated ” 
and the symbol un for the mutant gene concerned. 

The present paper gives data which indicate that un is closely 
linked to the agouti locus ; it therefore belongs to Linkage Group V 
(Dunn, Griineberg and Snell, 1940). No data are yet available on 
its segregations with the other mutants in this group, namely pallid 
(Roberts and Quisenberry, 1935), wellhaarig and kreisler (Hertwig, 
1942), nor with hydrocephalus-1 (Clark, 1936). 


2. MATERIAL 


The greater part of the material to which the data refer consists 
of four lines which are being set up in the Department of Genetics, 
Cambridge University. With a view to an extensive test of possible 
linkages of the new factor, undulated was introduced into these lines 
by Miss Margaret Wright during 1944 and 1945. It was intended 
to test it in a fifth line with a representative of Linkage Group V, 
namely wellhaarig ; but this was delayed as wellhaarig was not then 
available outside Germany. Meanwhile the other four lines, which 
were not intended to segregate at the agouti locus, continued to do 
so ; and during 1946 it became obvious that this continued segregation 
of the agouti alleles was associated with the segregation of undulated. 

The fact that the greater part of the data arose in this way explains 
why most of the matings concerned were intercrosses rather than 
backcrosses ; however, the linkage proved to be close, and the 
coupling intercrosses (which constitute the greater part of the data) 
are therefore almost as informative as coupling backcrosses. 

Four of the agouti alleles were involved, namely tan (q’), light- 
bellied agouti (A”) and non-agouti (a) as well as agouti itself (A). 
Tan was acting as a dominant (to non-agouti) in the repulsion 
matings and as a recessive (to light-bellied agouti or to agouti) in the 
coupling matings. 
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All the mice were normally classified as early -as possible, but the 
classifications were checked when the animals were between 18 and 
21 days old. 





3. DATA 


The basic data are given in table 1. This shows the number of 
progeny in each of the phenotypic classes from matings of five types : 


Coupling intercrosses with agouti A Un/a‘un 9 xA Un/a‘un 3g 


segregating 

Coupling intercrosses with light-bellied A”Un/a'unQ x A” Un/a'un 3 
agouti 

Coupling backcrosses with segregation in A”Un/a‘un Q x a‘un/a‘un 3 
the female 

Coupling backcrosses with segregation in a‘un/a'un 9 x A”Un/a‘un 3 
the male 

Repulsion intercrosses_ . , cf . @Un/ad'un 2 x aUn/a'un 3 


In one of these bodies of data, namely the coupling intercrosses 
in which agouti and tan were segregating, a more complete classifica- 
tion is possible, by virtue of the incomplete dominance of agouti 
over tan ; details of this classification are not, however, given. 

The data are unselected, save in the following respects :— 


(i) Any animal which died before reaching the age of 14 days 
has been excluded. It was found that some animals which 
appeared at birth to have undulated tails later became fully 
normal ; acceptance of an unchecked early classification 
might therefore have led to biassed results. The age of 
14. days was chosen because this was considered to be the 
earliest age at which a sure distinction could be made 
between undulated and normal animals. 

(ii) Albinism was segregating in one of the lines : albinos, which 
could not be classified for the agouti phenotype, were 
excluded. 

(iii) Fused was segregating in a few matings in one line. While 
there was no difficulty in classifying the fused progeny as 
such, difficulty was encountered in deciding whether these 
mice were also undulated or not; they were therefore 
excluded. 


4. ANALYSIS OF THE DATA 


On inspection, the data give an impression of good unifactorial 
segregations of both of the loci concerned (undulated and agouti), 
but of close linkage between them. This impression is confirmed by 
statistical analysis, an outline of which is given in table 2 :— 


(i) Segregation at the undulated locus. All five bodies of data show 
good unifactorial segregation at this locus. 
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(ii) Segregation at the agouti locus. The agreement between 
observation and expectation for the segregation at this 
locus is extremely good. As the data were unselected, 
except as indicated above, this must be attributed to chance. 















































TABLE 1 
Phenotypes of progeny 
Mating type Mating no. a 
AUn A un atUn atun 
Coupling intercross . V/24 2 ane 3 
V/25 14 I 2 
V/28 6 z 2 
V/29 4 2 
V/30 6 + 
Totals 32 I 13 
AY“Un A”un a'Un atun 
Coupling intercross . *6/17 12 4 
on *6/18 4 I 
6/30 36 2 13 
6/51 40 I 7 
*8/13 32 I I 12 
*8/15 35 2 I 10 
*8/16 3 
8/34 17 9 
Totals 179 4 4 56 
Coupling backcross ; 6/48 5 2 
female segregation 6/56 7 2 8 
8/42 I I 
Totals 13 2 II 
Coupling backcross ; 6/53 7 II 
male segregation 8/37 9 I 8 
VII/29 Ae I 
VII/30 7 I I I 
Totals. 23 I 2 21 
Repulsion intercross a'Un atun a Un aun 
*2/10 6 4 4 
“2/16 13 7 4 
2/23 7 7 2 
2/24 18 9 II 
*21/8 21 II 10 
21/21 15 15 4 
21/27 20 6 13 
V/18 y 2 4 
Totals . 107 61 53 























The data given by Wright (1947) include details of some or all of the progeny of the 


matings here marked with 


an asterisk. 


(iii) Method of analysis of the linkage. The “scoring method ” 


developed 


by Fisher (1946), which is a variant of his 
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** maximum likelihood’ technique, has been adopted ; 
its advantages include the comparative ease with which 
a joint test of significance and a joint estimate of the 
recombination fraction are obtainable from the five bodies 
of data. 

(iv) Significance. The x? testing independence of segregation, 
which has one degree of freedom, is 


x? = 288-7. 


The corresponding probability is extremely small; a 
hypothesis of independent segregation is therefore untenable. 

(v) Intensity of linkage. The joint estimate obtained for the 
recombination fraction is 


p = 3°86 per cent. 
and its standard error is calculated to be 


5 = 1°02 per cent. ; 


but Stevens (1942) has pointed out that it is incorrect to 
use a standard error to find fiducial limits in cases such 
as this where p is small compared with unity. The present 
results are not in a form suitable for the direct application 
of the correction tables he gives, and so the “ equivalent 
backcross data’ have been calculated ; i.e. a calculation 
has been made of the numbers of recombinant and non- 
recombinant progeny, which, if they had all been produced 
by backcross matings, would have yielded the same total 
score and the same total amount of information. Stevens’s 
corrections have been applied to these figures, and the 
limits within which p probably lies (for a 5 per cent. 
significance level) are found to be 


2°12 per cent. and 6-42 per cent. 


(vi) Homogeneity of the data. For the purpose of obtaining a joint 
estimate of the recombination fraction, it has been assumed 
that the frequency of crossing-over in the male is the same 
as that in the female, and also that it is the same when 
A” or a segregates with a‘ as when A segregates with a’. 
These assumptions are not contradicted by the data; the 
heterogeneity x? is not more than 2-523, which corresponds 
with a probability exceeding 0-5 since it has four degrees 
of freedom. The contributions to this x? made by the 
backcross data alone amount to 1°835, which for one 
degree of freedom corresponds with a probability exceeding 
0-17; and the contributions made by the coupling inter- 
crosses amount to 0°589, which for one degree of freedom 
corresponds with a probability exceeding 0-3. 
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TABLE 2 





Coupling | Coupling _ Coupling Coupling Repulsion 





A and a’ | A” and at male female 
segregating) segregating’ 


intercrosses| intercrosses|backcrosses:|backcrosses:|intercr ‘OSSes 








Segregation at the undulated locus : 
Recessives observed . . 14 60 22 I 61 
a =e lf 11°50 60°75 23°50 13°00 55°25 
GEES Sa 0) Be 0°725 0-009 0°479 0°154 0-798 
Segregation at the agouti locus : 
Recessives observed . : ‘ 13 60 23 13 53 
‘ expected . , ; 11°50 60°75 23°50 13°00 55°25 
x? . . ; , ; ‘ 0-261 0-009 0005 0-000 0°122 






































classes of progeny. 2 indicates summation over these classes. n=JZa. 
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SUMMARY 


1. Data are given which indicate close linkage between the agouti 
locus in the house mouse and the locus of Wright’s mutant “ un- 
dulated.” Undulated therefore belongs to Linkage Group V. Its 
position with respect to pallid, wellhaarig, kreisler and hydrocephalus-1 
is not known. 

2. The data are abstracted from the records at the Department of 
Genetics, Cambridge University ; they are unselected, except that to 

2A 


a dm 
m dp (score) . : - | —64°889 |—292-889 | —82-000 |—44:000 | —104°444 
2 

n z- (3) (information) . : 81-778 | 432:000 | 188-000 | 104-000 392°889 
Total score ‘ ; . : ; , : - — 588-222 
Total information ‘ ‘ ; , ; 1198-667 
Independence: x? . ‘ : ; . (- 588° ona) 130 667—= 288-658 

Estimating intensity (p = 0-0386) : 
dm 
nD ; ‘ ; . | —22°688 | —gr-o1g | +31°954 +26-850 —4°687 
di 2 | 
no (3) ly 183-702 | 6253°034 | 1266-502 700618 | 221-824 
| 
x? . ‘ ; , : ‘ | 0°435 O°154 0-806 1-029 0°099 | 
| 

Total score . ‘ , : i ; : : ; - +0°410 
Total information. ; : . 9625-680 
Estimate ofp . : - ; ca 0386-+-(0- 410-- 9625 680) 680) = 0°038643 
Standard error of estimate . ‘ 1+ 4/9625°680=o0-o10192 
Heterogeneity x? : : . O° 435-+0° 154-+0° 099+ 1:029+0°806 =2°523 


Note.—m is the expectation (in terms of »), and a is the observed number, in each of the phenotypic 
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ensure correct classification no mouse is included- which died before 
the age of fourteen days or which was also classified as fused ; albinos, 
being unclassifiable for agouti, are also excluded. The data are 
chiefly of the intercross type (coupling and repulsion) but some are 
of the backcross type. 

3. The results indicate good Mendelian segregation of undulated 
and of the agouti alleles involved. There is a highly significant inter- 
action between them, which leads (by Fisher’s scoring method) to an 
estimated recombination fraction = 3°86 per cent. with fiducial 
limits (for a 5 per cent. significance level) = 2-12 per cent. and 
p = 6-42 per cent. 
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Srxty years ago the first plant breeding station in Sweden was 
founded at the village of Svaléf in the southernmost district of Skane ; 
| to-day there are several important institutes, each with branch 

stations and sub-stations distributed over the country. Most of these 
institutes and stations are State supported, others are owned by private 
seed companies, some of which receive State aid for research. Plant 
breeding has therefore been taken seriously both by the State and by 
private enterprise. Its justification is to be found in the increased 

yields of crops over the last fifty years; in wheat, a 30 per cent. 
| increase has been estimated to be due to improved varieties alone. 
This fact, together with the comparatively small size but diverse 
nature of the country, makes the organisation one very suitable 
for study. 

Plant breeding and genetics that is controlled by the State is 
designed to meet three main requirements. (1) Research into new 
methods of breeding and fundamental knowledge. (2) Breeding 
improved varieties suited to many different conditions of soil and 
climate. (3) The efficient maintenance and distribution of the new 

varieties after they have been placed on the market. 

Fundamental research is carried out at the Institute of Genetics, 
University of Lund, under Prof. Miintzing, and at the Plant Breeding 
Institute at Svaléf under Prof. Akerman. The Institute of Genetics 
is also responsible for genetic teaching. These two Institutes have 
always been in very close touch, and in fact they were at first under one 
Director, Prof. Nilsson-Ehle. Even now the experimental grounds for 
the Genetic Institute are provided by the Plant Breeding Institute ; 
this dependence has its disadvantages which, however, will soon be 
removed by a decision to provide a new building with modern 
equipment in its own experimental grounds at Lund. This, and the 
proposed doubling of the staff are evidence of the importance attached 
to the fundamental and teaching aspects. 

Genetical teaching and research has also been carried out at the 
Agricultural College established at Ultuna near Uppsala under the 
direction of Prof. Turesson. This institute is also to be enlarged. 

The work, which has already been remarkably successful, is chiefly 
devoted to the production of new crops by acclimatisation, hybridisa- 
tion and the doubling of chromosome numbers. 
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Although the main function of the Plant Breeding Institute at 
Svaléf is the breeding of new varieties of agricultural crops, funda- 
mental work and the search for possible new methods of breeding 
are encouraged. Of this type of work the X-ray changes induced 
in barley by Dr Gustafsson have already given results of importance 
to practical plant breeding ; while the work of Dr Levan on the 
effects of colchicine and other polyploid-producing agents on the 
nucleus have given polyploids which show promising practical results. 

The second objective, the breeding of improved varieties suitable 
for environments, is done at the main institute at Svaléf in conjunction 
with the branch stations, which are distributed over the country 
according to the map, fig. 1. The co-operation of the main institute 
with the branch stations is a very important factor in the efficient 
use of the breeding material. 

Breeding and initial selection are done mainly at Svaléf, while 
in later generations testing and selection are done both at the main 
station and at the branch stations. In this way, under the varied 
conditions of the branch stations, the full potentialities of the crosses 
can be exploited so that varieties which may be suitable for particular 
local conditions would not be lost. For plants which are confined to 
a light soil, it has been found that even the initial breeding work 
must be done on such soil, and for this purpose the out-station at 
Ugerup is used. 

In the course of the breeding work it has been found that, with 
some plants such as wheat, oats and timothy grass, a single variety 
can be produced which is very suitable over large areas of cultivation. 
Such versatile varieties, however, have not been found in many other 
crops, and, with these, it has been necessary to produce several 
varieties to suit local conditions. The drawback with such material 
is that the multiplicity of varieties entails an increase in the cost of 
maintaining the élite and stock seeds. 

Recently, the policy of doing the breeding work at the main 
station has been modified. With a highly specialised crop that has 
to fulfil the requirements of a complex industrial process it may be 
necessary to set up a breeding station to deal only with this crop. 
Flax, for which there has recently been a great demand owing to the 
shortage of imported textiles, is bred separately from other crops at 
Mansabo near Svaléf under the directorship of Dr Grenhall. 

The third requirement, the maintenance and distribution of 
varieties produced at the Institute, is a difficult problem, as in other 
countries. It has been solved, at least for seed crops in Sweden, in this 
way. The main breeding Institute at Svaléf and the branch stations 
are owned by the Swedish Seed Association, which was formed in 
1892—actual breeding having begun in 1886—to encourage the use 
of improved varieties of agricultural crops. This common control of 
the main and branch stations ensures their full co-operation. The 
association, however, does not distribute the new varieties ; this is 
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done by a private company, the General Swedish Seed Company, 
formed specially for the purpose. It is run on orthodox business lines 
but has certain obligations to the Association. The Company has the 
sole right to all new varieties bred by the Association, for which a 
fixed royalty and a percentage of the sales are paid to the breeding 
Association ; and the Company agrees to the control of its business 
partly by the Central Government Seed Testing Station. The seed 
company keeps and distributes the stock seed but the Breeding 
Institute produces and controls the élite seed. Finally, the Company 
has no control over the work of the Breeding Institute. 

This system seems to work very satisfactorily in Sweden, and 
furthermore, it is acceptable to most of the rival seed firms because, 
as will be shown later, these private firms have their own departments 
of genetical research. It could not be expected to work smoothly in 
a country where seed firms do not have their own trained plant 
breeders. Under such conditions the system adopted in Denmark is 
probably the best. Here all the seed firms contribute to a society 
which carries out the breeding work and maintains the purity of the 
élite seed ; each seed firm benefits by growing and distributing the 
bulk seed of any new product of research. Thus with this method 
the seed grower and the breeder are organised to their mutual benefit. 

Both methods, the Swedish and the Danish, reduce the un- 
necessary replication of varieties of similar merit and type to a 
minimum, and they also discourage the pernicious practice of renaming 
an already existing variety to give the impression that it is something 
new. This is obviously true of the Danish co-operative system, and 
it also applies to the Swedish system, because the new products of both 
the State Breeding Institutes and the private seed firms are rigorously 
tested at Government stations before they are marketed. 

With breeding that is carried out by private seed companies the 
problem of seed distribution does not arise. In Sweden one of the 
largest of such breeding stations is at Weibullsholm near Landskrona 
on the southern point of the West Coast. This is owned by the firm 
of W. Weibull & Co., which originated in 1870 as a small seed farm. 
The Company has developed enormously, and in 1907 the first 
geneticist was appointed to direct the breeding work. At the present 
time the breeding station of this firm is under Dr Lamprecht and is 
organised into five departments, each dealing with different crops, 
including all cereals, fodder plants, vegetables and flowers. Many 
valuable varieties of plants have originated from this station, and at 
present the varieties of wheat produced there are among the best in 
Sweden. Apart from the trials made at Weibullsholm, all new 
varieties are finally tested for yield at the Government testing 
station. 

Close by, at Hilleshég, is another large private breeding station 
owned by the Swedish Sugar Manufacturing Company and directed 
by Dr Rasmusson. Its work is confined to sugar-beet ; the strains 
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of beet that have been produced there are extremely successful in 
competition with other strains both in Sweden and in other countries. 

The work of selection and isolation is carried out at a number 
of branch stations. One important result of selecting in different 
environments has been found in sugar-beet. Starting with somewhat 
heterozygous material, selection for such polygenic characters as high 
sugar content and low noxious nitrogen is made on plants grown at 
a number of different places. The basis of selection is the same in all 
the localities, and the end performance of the lines is similar. Genetic- 
ally, however, the end-result is quite different. This is shown by the 
heterosis in the F,, which is found when these selected lines from 
different localities are crossed. Thus, in an outbreeding plant like 
sugar-beet, selecting a reduced number of phenotypes in one 
environment leads to greater homozygosity of the particular polygenic 
complexes selected. The same basis of selection in a different 
environment also leads to more homozygosity but of different com- 
plexes. Intercrossing the two selections would lead to heterosis. 
The practical importance of such a method is self-evident, and steps 
are being taken to exploit this commercially. 

Triploid and tetraploid sugar-beet have given promising results 
both at Hilleshég and Svaléf. 

Other private firms engaged in breeding are those of O. J. Olson 
at Hammenhég and Algot Holmberg & Sons at Norrképing ; these 
are notable for root and herbage plants and for their vegetable and 
flower seeds. Although the breeding of vegetables is carried out by 
a number of private firms, the State also provides for breeding in 
these crops at a department of the Horticultural Research Station at 
Melangard, Alnarp, under Dr Lamm. 

The success of these privately-owned breeding stations is probably 
due to two main factors: (1) A trained staff under a scientist is 
employed, (2) the companies are large and can afford the capital 
outlay and continued expense necessary for breeding work on an 
adequate scale. Wiebullsholm is owned by a large single company, 
while Hilleshég station is owned by a combine of all sugar manu- 
facturers in Sweden. 

To turn to a different class of plants, breeding has recently been 
started in forest and fruit trees. Early in 1936 the Association of 
Forest Tree Breeding was founded under the directorship of Prof. 
Sylvén ; later in the same year the Association was supported by the 
Forest Owners’ Society and, in 1937, by the Swedish Match Company. 
In 1938 the sum of 400,500 crowns was given by the Knut and Alice 
Wallenberg Foundation to purchase the grounds and equipment for 
the Forest Tree Breeding Station at Ekebo near Svaléf. Four branch 
stations have since been added. 

The Association started the work by making an inventory of 
Swedish forests and by laying down, under varied conditions, trials of 
material collected from the whole country. In the breeding work 
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that followed, all the techniques of modern genetics have been 
applied ; these include hybrid seed production, statistical design 
of trials, artificial illumination for winter flowering and colchicine 
treatment. Special methods, which are necessary for the ease of 
handling, have been devised ; one of these is the grafting of flowering 
shoots on to small stock trees, thus making a small flowering tree. . 

Already it is possible to generalise from the work ; for example, 
in conifers, hybrid vigour between diploid strains shows good results 
while polyploidy is of doubtful value ; in deciduous trees on the 
other hand, a marked improvement can be effected with polyploidy. 
Thus triploid forms of ash and poplar, found among wild populations, 
have a growth rate nearly twice that of the diploid ; triploids obtained 
in oak from twin acorns have a similar advantage. Perhaps of equal 
importance is the discovery of a general correlation between the 
growth and performance of young seedlings and the value of the 
mature trees, thus giving a basis for selection in the young seedlings. 

In 1938, when the State Horticultural Research Stations were set 
up at Alnarp and Balsgard under Prof. Nillson, fruit breeding was 
put on a national basis. Before this, however, Prof. Dahl and 
Dr Johansson had been breeding fruit at the Horticultural College 
at Alnarp. Improvement of soft and tree fruits is in progress, and 
already a result has been obtained which may be a starting-point of a 
whole series of new apple varieties. This is the production of tetraploid 
apples by Prof. Ehle’s method of raising many seedlings from the 
already existing triploid varieties. A small percentage of the progeny 
are tetraploids, of which some have fruited with promising results. 

Before new varieties of fruit are put on the market they are tested 
at two State branch stations at Ranna, east of Gothenburg and at 
Nychelby near Stockholm, and also at schools and privately-owned 
fruit farms. The schools and growers record crop weights, season of 
ripening, incidence of disease and other characters of economic value ; 
in return for this the trees are supplied free and, apart from small 
samples of fruit, the crop belongs to the grower. This system works 
well, particularly with the schools, mainly because their agricultural 
teaching is suited to this kind of work. 

Proved varieties are propagated and sold by a State Horticultural 
College, which has a commercial fruit department ; this method is 
similar to the distribution of agricultural seed produced by the Svaléf 
station and it avoids the difficulty of favouring any one commercial 
firm. 

An encouraging sign to the tree breeder is the speed with which 
new polyploid and hybrid forest and fruit trees have been propagated. 
The enterprising nursery firm of Holgar Jensen of Ramlésa already 
grows several new forest trees; this firm also does useful breeding 
work with Rubus and Fragaria. 

Again, as in the agricultural crops, the fruit breeding is supported 
mainly by the State and, to a less extent, by private subscription, 
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but the whole work gains much by the genuine interest taken in it 
throughout the country. The fact that the early breeding in cereals 
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paid handsomely has naturally been a great incentive to extend such 
work to long-term forest and fruit crops. 














A MAP OF SWEDEN SHOWING THE DISTRIBUTION CENTRES 
OF RESEARCH IN PLANT BREEDING 


GOVERNMENT INSTITUTES 


Teaching and Research 


O Institute of Genetics, University of Lund . ; ‘ “ - Prof. A. Miintzing 
© Agricultural College, Ultuna, Uppsala... ‘ ; ‘ - Prof. G. Turesson 
Breeding and Research 
Agricultural Crops 
A Plant Breeding Institute, Svaléf . ° . Prof. A. Akerman 

A Branch Stations of the Swedish Seed Association 

Fruit and Forest Trees, Horticultural Crops 
@ State Horticultural Research Station, Alnarp, Malmé ‘ - Prof. F. Nillson 
@ State Horticultural Research Station, Balsgard, Kristienstad - Prof. F, Nillson 
@ Forest Tree Breeding Station, Ekebo , ‘ ‘ ° - Prof. Nils Sylvén 
PRIVATE SEED COMPANIES (some receiving State aid) 

@ W. Weibull & Co., Weibullsholm, Landskrona - Dr H. Lamprecht 
@ Swedish Sugar Beet Breeding Institute, acres Landskrona - Dr J. Rasmusson 
@ O.J. Olson & Co., Hammenhég: . - DrS. Undenas 
@ Algot Holmberg & Sons, Norrképing , ; ‘ ; - MrA. Holmberg 


Note-—Wiad, Eldtomta, near Stockholm, is the site of the Animal Breeding Institute : 
Director, Prof. Gert Bonnier. Here are conducted the important investigations on 
twin cattle, practica) breeding of sheep and poultry, and pure researches on 
Drosophila genetics. 
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REVIEW 


GENETICS. By Edgar Altenburg. London: Constable, 1947. Pp. xii+452. 16s. net. 


When Bateson was writing Mendel’s Principles of Heredity it was still 
possible to cover the whole of genetics, both observations and theory, in 
one account. This possibility vanished, however, almost immediately 
after the appearance of Bateson’s book in 1g0g. Since that time it has 
always been necessary to select the material which is to be included in any 
book on genetics. 

The choice of material depends, of course, primarily on the aim of the 
book. This may be to bring together all the observations of a particular 
kind, such as chromosome numbers, gene segregations or technical methods. 
Or it may be to present a detailed consideration of both experiment and 
theory in some particular and limited branch of the subject. Or it may be 
to present students with a textbook which covers all branches of the subject, 
but treats each of them only to a limited extent. 

Dr Altenburg’s Genetics is of this last kind. He aims, so he tells us, at 
giving the student an account of genetics as a growing science, of genetics 
as it is concerned with evolution, development and cell chemistry as well 
as in its more mechanical aspects. The structure of the book, however, 
reflects the author’s training as a zoologist, and experience as a member of 
Morgan’s early Drosophila laboratory, as much as it does his avowed aim. 

Chromosomes are introduced right at the start, which is in fact an account 
of mitosis and meiosis. Then follows what comes to seem almost like a 
digression, into the inter-relations of heredity, environment and character, 
before the main trend is resumed with an account of mendelian behaviour. 
This is elaborated and extended in relation to multiple factor inheritance, 
sex determination, inbreeding and outbreeding, and selection. Linkage 
is introduced at this point and related to meiosis. Next follow multiple 
allelomorphs, structural change, especially as it is concerned with cyto- 
genetic chromosome maps, polyploidy and polysomy. Turning to mutations, 
their artificial production is discussed as well as their natural behaviour. 
Together with an account of balanced lethals and complex hybridity, all 
this takes up eighteen chapters. The book is then rounded off with 
a single chapter on heredity and development, including cytoplasmic 
behaviour, and another on the genetical basis of evolution. 

That the dismissal of development and evolution in their genetical 
aspects with but two chapters out of twenty between them, represents an 
unbalance of treatment, few will probably deny. The cause from which 
it springs, however, may be less readily admitted. Altenburg’s treatment 
of any phenomenon or theory appears to have been governed largely by 
the confidence which he felt in the relevant experimental evidence and its 
interpretation, rather than by the potential importance of the subject 
itself. At first sight this might seem a desirable rule in any text designed 
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for students. It is nevertheless one which, if followed at all closely, must 
inevitably lead to distortion. Thus, for example, the behaviour of ring 
chromosomes in Drosophila melanogaster receives, and indeed on this rule 
must have received in January 1945 when the book was written, almost 
as much attention as does cytoplasmic effects in animals. When, 
however, the importance is remembered of the cytoplasm as the nucleus’ 
agent in development, such a treatment must seem cursory even at that 
date. More recent developments in genetics serve only to emphasise this 
point. 

In the same way extra-nuclear inheritance in plants is referred entirely 
to the plastids, whose distribution from cell to cell and transmission from 
mother to offspring can be followed by the same microscopic method which 
serves for chromosomes. The cytoplasmic element in the inheritance of 
male-sterility in maize, flax or )icotiana may not indeed be formally 
separable from the plastids; but neither is plastid inheritance formally 
distinguishable from that of other cytoplasmic elements. We must recognise 
that reference of male sterility to plastids is made much less reasonable 
than its reference to other cytoplasmic constituents by considerations 
beyond those of mere hereditary transmission. We need, in fact, something 
more than the approach of formal genetics, whether it is to the task, which 
may seem trivial to a zoologist, of describing extra-nuclear inheritance in 
plants ; or to that, which few will regard as trivial, of seeing genetics in 
the broader setting at which Altenburg avowedly aims. 

The treatment of evolution is similarly limited. Natural selection is 
discussed in its formal aspects, but no attempt is made to use it in the 
interpretation of those broad similarities and differences which are so 
important to the understanding of how genetic systems determine the course 
of evolutionary change and are, in their turn, determined by it. Indeed, 
breeding systems, whose importance in relation to variation and selection 
was at least partly understood by Darwin, receive no greater mention than 
does Darwin himself. His name is not to be found in the index. 

The full appreciation of the inter-relations of genetics and natural 
selection, of Mendelism and Darwinism, requires of course a consideration 
of plants equally with animals ; and it may be that Altenburg’s treatment 
has suffered for this reason. Certainly he markedly subordinates plants to 
animals, and at times his discussion suggests an utter unfamiliarity with 
plants and plant genetics. Some of us will be surprised to learn, for 
example, that “ the corolla (is) the trumpet-shaped part of a flower, consisting 
of the fused petals.” We may even venture to think that students to whom 
such an explanation is necessary would be better employed in obtaining some 
further training in elementary biology before taking up genetics. We can 
‘only conclude that the plant kingdom is foreign territory to the author, and 
this conclusion will be strengthened, to take another example, by his use of 
the germ cells as including not only the gametes but also all cells ancestral 
to them. Such a usage, whether it is desirable or not in animals, would in 
plants reduce the term to a farce ; for even in many of the flowering plants 
almost every cell is potentially a germ cell in this sense. 

These lapses, of which a few examples have been cited, are made the 
more dangerous by the dogmatic style of writing. Some dogmatism is 
scarcely to be avoided in a student’s book, but the arts of persuasion are 
not wholly out of place even in writing such a text. 
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The book will have its uses: that is ensured by the provision of so 
much information about genetical method and so many problems for 
exercise. It cannot, however, be regarded as attaining its author’s object. 
In writing it he has not managed to strike the delicate balance between the 
narrowly formal and the broadly comprehensive, between the proven fact 
and the likely inference, which the presentation of genetics as the living 
heart of so much biological theory must demand. 


K. MATHER 
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ABSTRACTS of Papers read at the NINETY-FOURTH MEETING of the 
Society, held at 2 p.m. on WEDNESDAY, 10th DECEMBER, 1947, in 
the Rooms of the LINNEAN SOCIETY, Burlington House, Piccadilly, 
London, W.|I. 


THE NUCLEOLUS AND NUCLEOLAR ORGANISERS 
IN SPIROGYRA 


M. B. E. GODWARD 
Botany Department, Queen Mary College, London 


The satellite chromosomes of Spirogyra spp. hitherto found only in S. crassa 
are shown to be the nucleolar-organising chromosomes, the nucleolar organising 
region being apparently the satellite stalk itself. In the resting nucleus the structures 
representing the satellite stalks are contained within the nucleolus in a loosely 
coiled form, each stalk being enclosed throughout its length in a specially 
differentiated tube of nucleolar material which is termed the nucleolar tubule. 
The coiled nucleolar tubules stain densely while the background material of the 
nucleolus stains less densely. During the breakdown of the nucleolus in prophase, 
a characteristic stage, where the contracted nucleolar tubule alone remains around 
the satellite stalk, is reached. All the nucleolar material has been dispersed at 
metaphase, but varying amounts of stainable material (according to the species) 
which may be derived from the nucleolus, have simultaneously appeared throughout 
the nucleus. According to the species, some of this material is more or less con- 
centrated around the metaphase plate. In one species, some of this material precedes 
the chromatids to the poles and has accumulated there as globules at a time when 
the chromatids are only halfway between equator and poles. 

Stainable material is sloughed off the chromosomes at telophase and globules 
arise in the daughter nucleus. The nucleoli are reorganised on the satellite stalks 
and simultaneously the globules generally all disappear. On occasion one, two or 
three are left and persist in the resting nucleus (the Binnenkérper and Nebenkérper 
of the older authors). 

Heterochromatic staining is shown by “‘ chromocentres ” and alternate blocks 
along the length of the prophase chromosomes, also by the satellite stalks at meta- 
phase. There are only indications of separate chromatids in prophase ; none at 
metaphase when the chromosomes have a sinuous shape and are without distinc- 
tion of right and left arms. Evidence of the existence of matrix and the question 
of centromeres in Spirogyra is discussed. 

The historical controversy between Van Wisselingh and Geitler on the origin of 
chromosomes from the nucleolus in Spirogyra is resolved, with the confirmation 
of the visual observations of Van Wisselingh (1898) and of the conclusions of Geitler, 
except in regard to the satellite stalks. 


THE B-CHROMOSOMES OF PARTHENIUM ARGENTATUM 


D. G. CATCHESIDE 
Botany School, Cambridge 


The B-chromosomes of P. argentatum are heterochromatic, remaining condensed 
in pollen mother cells until diakinesis. Prior to metaphase I they commence to 
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lose stainability and, pari passu, to uncoil until at late’ anaphase I they are long 
slender threads. Association, presumably by chiasmata, occurs between them to 
a moderate degree, but some of this appears to lapse between diakinesis and 
metaphase I at which stage univalent B-chromosomes are the rule. At anaphase I, 
the B-chromosomes lag behind the normal ones and the univalent ones tend to 
proceed together to one pole rather than at random. 


SUBDIVISIONS OF THE BLOOD GROUPS MN IN MAN: LINKED 
LOCI OR MULTIPLE ALLELOMORPHS ? 


RUTH SANGER and R. R. RACE 


Medical Research Council, Blood Group Research Unit, 
The Lister Institute, London 


In comparison with the ABO and Rh blood groups, the MN groups have 
appeared singularly uncomplicated since their discovery in 1927 by Landsteiner and 
Levine. A serum, sent from Australia, containing a “‘ new” agglutinin has been 
found to disclose subdivisions within the MN system. 

This antibody agglutinates 72 per cent. of MM, 60 per cent. of MN and 
33 per cent. of NN bloods; such bloods are designated by the addition of the 
symbol S. 

Statistical analysis is consistent with the hypothesis that there are either four 
allelomorphs MS, M, NS and N, or that S is a gene closely linked with the MN 
locus. The gene frequencies are MS 25°05 per cent., M 26:27 per cent., NS 
9°21 per cent. and N 39:47 per cent. Expected phenotype frequencies calculated 
from these figures agree closely with those observed. The results of family investiga- 
tions lend further support to the interpretation ; S is shown to segregate with M 
in some families, with N in others. 
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